simple_bench_comp_dataset Workspace – Weights & Biases

Skip to main content

Assets

simple_bench_competition_hard_set-evaluation:v0

Name

simple_bench_competition_hard_set-evaluation(1 version)

Last updated

8 months ago

Last updated by

Storage size

998B

simple_bench_competition_hard_set:v0

eval_multi_choice:v0

Summary

1

Model

2

true_count

3

true_fraction

4

Avg. Latency

5

Run Date

6

Trials

7

LiteLLMModel:v1

1.00

10.00%

33.04

8 months ago

1.00

LiteLLMModel:v2

0.00

0.00%

167.26

8 months ago

1.00

LiteLLMModel:v6

1.00

10.00%

29.63

8 months ago

1.00

Total Rows: 3