Aurick-qiao's workspace
Runs
8
Name
8 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
ARC-C 0 shot
ARC-C 25 shot
ARC-E 0 shot
BoolQ 0 shot
COPA 0 shot
DROP 3 shot
GSM8K 5 shot
HellaSwag 0 shot
HellaSwag 10 shot
HumanEval pass@1 (t=0.01)
HumanEval pass@1 (t=0.2)
HumanEval pass@1 (t=0.8)
HumanEval pass@10 (t=0.2)
HumanEval pass@10 (t=0.8)
MBPP pass@1 (t=0.01)
MBPP pass@1 (t=0.1)
MBPP pass@1 (t=0.8)
MBPP pass@10 (t=0.1)
MBPP pass@10 (t=0.8)
MMLU 0 shot
MMLU 5 shot
Openbook QA 0 shot
PIQA 0 shot
RACE 0 shot
Step
Truthful QA 0 shot
Winogrande 0 shot
Winogrande 5 shot
cpp pass@1, t=0.01
cpp pass@10, t=0.8
cs pass@1, t=0.01
cs pass@10, t=0.8
d pass@1, t=0.01
d pass@10, t=0.8
go pass@1, t=0.01
go pass@10, t=0.8
java pass@1, t=0.01
java pass@10, t=0.8
jl pass@1, t=0.01
jl pass@10, t=0.8
js pass@1, t=0.01
js pass@10, t=0.8
loss
lua pass@1, t=0.01
Finished
-
richard-fan
4s
-
37.543
42.833
64.731
66.636
86
3.467
2.123
69.647
71.619
-
7.439
-
14.153
-
-
8.92
-
18.655
-
28.047
25.721
39.6
75.843
38.565
-
38.958
63.141
64.799
6.832
-
3.165
-
3.205
-
53.896
-
3.165
-
5.66
-
9.938
-
-
3.106
Finished
-
richard-fan
5s
-
42.577
47.44
70.749
74.434
83
4.39
12.358
72.894
74.378
-
23.902
-
36.336
-
-
30.988
26.825
40.162
58.621
42.456
48.423
41.2
78.074
38.182
-
36.468
67.009
68.824
24.224
-
17.089
-
6.41
-
74.675
-
22.785
-
20.755
-
29.193
-
-
22.981
Finished
-
richard-fan
5s
-
44.625
51.706
70.328
82.783
85
36.167
28.052
73.312
76.12
31.707
34.116
31.037
50.36
65.755
39.4
39.112
33.775
45.911
59.895
52.789
53.215
42
77.856
41.148
-
47.29
68.114
70.639
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
richard-fan
5s
-
38.908
47.014
67.34
72.355
80
4.437
10.387
70.345
71.968
28.049
28.384
21.067
40.019
52.764
37.4
36.375
23.05
46.287
56.372
42.325
48.775
39.8
76.768
38.182
-
35.908
65.509
67.403
23.602
22.617
17.089
32.179
7.051
13.566
70.13
96.228
27.215
37.146
24.528
28.383
29.814
48.125
-
23.602
Finished
-
richard-fan
6s
-
42.833
50.085
68.519
80.703
88
34.662
27.748
70.823
72.774
31.707
34.085
33.902
49.183
69.273
41
39.9
32.062
46.084
61.672
51.188
52.727
38
76.442
40.861
-
44.974
68.114
68.35
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
richard-fan
9s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1.76562
-
Finished
-
richard-fan
20s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1.14844
-
Finished
-
richard-fan
16s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
0.72266
-
1-8
of 8