Skip to main content
simplebench
Projects
simple_bench_comp_dataset
Objects
simple_bench_public-evaluation
qJ79FnTXN1iBdEIixWYXXmRWJqDVtWYO4vijUYlWqYQ
Log in
Sign up
Project
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Assets
All assets
Prompts
Ops
Models
Datasets
Scorers
simple_bench_public-evaluation:v0
Name
simple_bench_public-evaluation
(1 version)
Last updated
8 months ago
Last updated by
Jonas Zabel
Storage size
970B
Leaderboard
Values
Use
Calls
simple_bench_public:v0
eval_multi_choice:v0
Summary
1
Model
2
true_count
3
true_fraction
4
Avg. Latency
5
Run Date
6
Trials
7
LiteLLMModel:v7
2.00
20.00%
45.86
8 months ago
1.00
LiteLLMModel:v6
2.00
20.00%
34.29
8 months ago
1.00