ai-hacker-cup-benchmark Workspace – Weights & Biases

Skip to main content

Assets

Evaluation:v1

Name

Evaluation(2 versions)

Last updated

1 year ago

Storage size

0B (0B from all versions)

Summary

1

Model

2

solution_passed.true_count

3

solution_passed.true_fraction

4

Avg. Latency

5

Run Date

6

Trials

7

ReflectionSolver:v0

13.00

52.00%

330.35

1 year ago

5.00

ReflectionSolver:v1

13.00

52.00%

95.36

1 year ago

5.00

OneShotSolver:v6

5.00

20.00%

58.95

1 year ago

5.00

OneShotSolver:v7

9.00

36.00%

220.68

1 year ago

5.00

Total Rows: 4