Gyr679's workspace
Runs
56
Name
12 visualized
Tags
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
train
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
train
GSM8K
chain
evaluation
GSM8K
chain
train
GSM8K
chain
train
GSM8K
chain
train
Created
Runtime
End Time
ID
Notes
Updated
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_420/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_430/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_440/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_450/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_460/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/exact_match_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/majority_vote_acc
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/none_answer_extracted_frac_per_problem
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/once_hit
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_470/validation__0__1/unique_answer_count
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_480/validation__0__1/correct_frac
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_480/validation__0__1/exact_match
analysis/TaskPerformanceAnalyzer/gsm8k_validation/ckpt--iter_480/validation__0__1/exact_match_frac
4d 5h 56m 49s
Apr 20 '25 12:38
yc23440y
-
Apr 21 '25 12:53
0.68649
0.90885
0.68649
0.76139
0.0011729
0.90885
3.09651
0.68398
0.89276
0.68398
0.76408
0.0016756
0.89276
3.02413
0.6875
0.89544
0.6875
0.75603
0.00050268
0.89544
2.94638
0.69353
0.90617
0.69353
0.76944
0.00033512
0.90617
2.94102
0.68884
0.9008
0.68884
0.75335
0.0010054
0.9008
2.96783
0.69889
0.90349
0.69889
0.7748
0.0008378
0.90349
2.95979
0.69638
0.90617
0.69638
4h 38m 26s
Apr 01 '25 19:15
2bwa662b
-
Apr 15 '25 06:27
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3d 8h 29m 57s
Apr 01 '25 13:14
ijgrqg01
-
Apr 15 '25 06:27
0.62584
0.89276
0.62584
0.74263
0.0013405
0.89276
4.31635
0.61277
0.89008
0.61277
0.72386
0.002681
0.89008
4.36997
0.61947
0.8874
0.61947
0.73458
0.0008378
0.8874
4.19035
0.61327
0.87936
0.61327
0.71314
0.0010054
0.87936
4.07775
0.61294
0.87936
0.61294
0.71582
0.00033512
0.87936
4.19035
0.62366
0.8874
0.62366
0.73995
0.0011729
0.8874
4.13137
0.62735
0.89008
0.62735
3h 45m 28s
Feb 23 '25 13:58
wm5m6g45
-
Apr 15 '25 02:55
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
5h 19m 38s
Feb 22 '25 13:52
hvupmenp
-
Apr 15 '25 02:55
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1d 59m 41s
Feb 23 '25 09:06
q0mserei
-
Apr 15 '25 02:55
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
0.65985
0.89008
0.65985
4h 47m 57s
Feb 21 '25 18:04
lg3j2dp3
-
Apr 15 '25 03:00
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1d 1h 2m 37s
Feb 21 '25 14:30
3vx2vtts
-
Apr 15 '25 02:55
0.65499
0.8874
0.65499
0.71582
0.00033512
0.8874
3.12601
0.65516
0.87668
0.65516
0.70777
0.00033512
0.87668
2.93566
0.66488
0.88204
0.66488
0.72922
0.00016756
0.88204
2.87936
0.65952
0.87936
0.65952
0.7185
0
0.87936
2.94638
0.67745
0.9008
0.67745
0.76408
0
0.9008
2.91153
0.66019
0.89812
0.66019
0.71046
0.00033512
0.89812
2.89276
-
-
-
1d 9h 58m 16s
Feb 21 '25 13:15
qifc7jim
-
Apr 15 '25 03:00
0.687
0.9008
0.687
0.76139
0.00016756
0.9008
2.95979
0.67594
0.8874
0.67594
0.73727
0
0.8874
3.06971
0.67108
0.89276
0.67108
0.74263
0
0.89276
2.97051
0.67158
0.87399
0.67158
0.73995
0
0.87399
3.06971
0.67946
0.89008
0.67946
0.73995
0.00016756
0.89008
2.9866
0.68314
0.89544
0.68314
0.7319
0.00033512
0.89544
2.85523
0.68716
0.89812
0.68716
2d 5h 29m 50s
Feb 22 '25 08:31
soxnjsgx
-
Apr 15 '25 02:55
0.68582
0.87131
0.68582
0.73727
0.00016756
0.87131
2.77748
0.68851
0.87936
0.68851
0.75067
0.00067024
0.87936
2.68633
0.68264
0.86863
0.68264
0.72654
0.00050268
0.86863
2.70777
0.68398
0.88472
0.68398
0.72922
0
0.88472
2.6193
0.68817
0.87668
0.68817
0.73458
0.00016756
0.87668
2.58177
0.68968
0.88204
0.68968
0.72654
0
0.88204
2.64075
0.69688
0.88204
0.69688
1h 31m 46s
Feb 20 '25 04:17
xbsn1w1n
-
Apr 15 '25 02:55
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3h 29m 38s
Feb 19 '25 17:20
7m1mbw2l
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3h 30m 37s
Feb 19 '25 17:20
y2wfrk4i
-
Apr 15 '25 03:00
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3h 30m 22s
Feb 19 '25 17:20
ccnx91az
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
2h 15m 3s
Feb 19 '25 12:36
511a6793
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1d 6h 1m 11s
Feb 20 '25 12:50
wi94ugrj
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
6h 33m 15s
Feb 19 '25 12:35
srwjai27
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
11d 2h 41m
Mar 01 '25 11:24
hghqe3lk
-
Apr 15 '25 03:00
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1d 4h 2m 27s
Feb 19 '25 10:20
7xb80udn
-
Apr 15 '25 02:56
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
23h 42m 45s
Feb 19 '25 06:00
v7r0tnaj
-
Apr 15 '25 02:56
0.63304
0.88204
0.63304
0.71582
0.00050268
0.88204
3.99196
0.63137
0.86327
0.63137
0.70777
0.00033512
0.86327
3.91153
0.63773
0.87399
0.63773
0.7185
0
0.87399
3.83914
0.62601
0.87668
0.62601
0.71046
0.00067024
0.87668
3.82306
0.6193
0.87399
0.6193
0.69437
0.0008378
0.87399
3.91957
0.61411
0.86863
0.61411
0.69705
0.00033512
0.86863
3.89276
0.61595
0.85791
0.61595
1-20
of 26