Skip to main content
simplebench
Projects
simple_bench_comp_dataset
Objects
simple_bench_competition_hard_set-evaluation
XRvBx4XMm2kXAhbwosntYqchh4ZYu5TUGH0dS5XRDeU
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
simple_bench_competition_hard_set-evaluation:v0
Name
simple_bench_competition_hard_set-evaluation
(1 version)
Last updated
8 months ago
Last updated by
Jonas Zabel
Storage size
998B
Leaderboard
Values
Use
Calls
simple_bench_competition_hard_set:v0
eval_multi_choice:v0
Summary
1
Model
2
true_count
3
true_fraction
4
Avg. Latency
5
Run Date
6
Trials
7
LiteLLMModel:v1
1.00
10.00%
33.04
8 months ago
1.00
LiteLLMModel:v2
0.00
0.00%
167.26
8 months ago
1.00
LiteLLMModel:v6
1.00
10.00%
29.63
8 months ago
1.00