Skip to main content
simplebench
Projects
simple_bench_public
Traces
Log in
Sign up
Overview
Traces
Evals
Playground
Monitors
Leaders
Threads
Assets
Traces
All Ops
Filter
Past 1mo
inputs
output
eval_multi_choice
model_latency
Trace
Feedback
Status
model
self
true_count
true_fraction
mean
User
Called
Tokens
Cost
eval-2025-08-12-calm-wind
d644
LiteLLMModel:v4841
Evaluation:v48
N/A
N/A
0.7221
2 weeks ago
0
$0.0000
eval-2025-08-12-fierce-cloud
c370
LiteLLMModel:v4841
Evaluation:v48
N/A
N/A
0.7217
2 weeks ago
0
$0.0000
eval-2025-08-09-fierce-tiger
f57d
LiteLLMModel:v4840
Evaluation:v48
1
0.1
11.175
2 weeks ago
0
$0.0000
Evaluation.evaluate
2e5b
LiteLLMModel:v4839
Evaluation:v47
N/A
N/A
N/A
3 weeks ago
0
$0.0000
eval-2025-08-05-keen-bird
5bb0
LiteLLMModel:v4841
Evaluation:v48
N/A
N/A
N/A
3 weeks ago
0
$0.0000
eval-2025-08-01-calm-lake
f76f
LiteLLMModel:v4838
Evaluation:v48
2
0.2
4.0414
4 weeks ago
0
$0.0000
eval-2025-08-01-tender-dolphin
8bd5
LiteLLMModel:v4837
Evaluation:v48
N/A
N/A
1.4192
4 weeks ago
0
$0.0000