Skip to main content

Evaluation Comparison Report - test

Comparing evaluations
Created on February 7|Last edited on February 7

volcanic-violet-123 Run set hellaswag/accvolcanic-violet-123 Run set hellaswag/acc_norm_stderrsuper-donkey-122 Run set hellaswag/accsuper-donkey-122 Run set hellaswag/acc_norm_stderr0.000.100.200.300.40
meta
11s
13s
summary
_wandb
8
9
evaluation
table-file
table-file
4.09629
4.23927
1707206095.37606
1707205000.7024
table-file
table-file
table-file
table-file
Run set
1
Run set
1