Skip to main content
wandbot
Projects
wandbot-dev
Objects
Evaluation
Y2MX7N9QXXmXTjRjwKmkDsTWCnB3Xv1eRGTQXEVEXRQ
Log in
Sign up
Project
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
Assets
More
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
Evaluation:v0
Name
Evaluation
(5 versions)
Last updated
1 year ago
Storage size
0B (922B from all versions)
Leaderboard
Values
Use
Calls
Dataset:v0
Summary
get_answer_correctness:v0
1
Model
2
Run Date
3
Trials
4
Avg. Latency
5
answer_correctness.true_count
6
answer_correctness.true_fraction
7
EvaluatorModel:v0
1 year ago
1.00
39.60
76.00
77.55%