simple_bench_comp_dataset Workspace – Weights & Biases

Skip to main content

Assets

simple_bench_public-evaluation:v0

Name

simple_bench_public-evaluation(1 version)

Last updated

8 months ago

Last updated by

Storage size

970B

simple_bench_public:v0

eval_multi_choice:v0

Summary

1

Model

2

true_count

3

true_fraction

4

Avg. Latency

5

Run Date

6

Trials

7

LiteLLMModel:v7

2.00

20.00%

45.86

8 months ago

1.00

LiteLLMModel:v6

2.00

20.00%

34.29

8 months ago

1.00

Total Rows: 2