Zjowowen's workspace
Runs
9
State
Notes
User
Tags
Created
Runtime
Sweep
adv_max
adv_mean
approx_kl
clipfrac
cur_lr
entropy_loss
env step
episode return mean
episode return std
policy_loss
train iter
value_loss
value_max
value_mean
Finished
-
zjowowen
4h 43m 43s
-
3.8275
-2.3618e-9
0.033177
0.046563
0.0003
1.34668
9990904
-21
0
-0.0037195
312100
0.0083168
2.08818
1.23288
Finished
-
zjowowen
4h 59m 33s
-
3.41552
-2.0787e-9
0.007647
0.013594
0.0003
1.46036
9990786
20
0
-0.0013468
312100
0.005658
2.03882
1.10553
Finished
-
zjowowen
4s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
zjowowen
6h 45m 41s
-
3.2381
-2.1011e-9
0.019929
0.014281
0.0003
1.42477
9991159
20
0
-0.00088263
312100
0.0016906
2.02805
1.18655
Finished
-
zjowowen
6h 30m 46s
-
4.53342
-5.4464e-9
0.009223
0.014094
0.0003
1.49031
9991500
20
0
-0.0020601
312100
0.00032922
1.81346
1.19578
Failed
-
zjowowen
4h 57m 14s
-
4.41132
8.9183e-9
0.000815
0.0065
0.0003
1.51139
9991386
20
0
-0.0013078
312100
0.0099915
2.28891
1.47454
Crashed
-
zjowowen
3h 43m 2s
-
3.9769
-1.5981e-9
0.043008
0.15741
0.0003
0.95951
6854088
21
0
-0.023808
214100
0.0092767
2.15597
1.33713
Finished
-
zjowowen
6h 15m 42s
-
3.39289
-3.3528e-9
0.011706
0.036313
0.0003
1.42696
9991340
20
0
-0.0050302
312100
0.013981
2.06176
1.19555
Finished
-
zjowowen
6h 8m 45s
-
3.84976
-2.1607e-9
0.063147
0.096469
0.0003
0.97342
9991349
-2
-
-0.011577
312100
0.03653
1.93893
0.71499
1-9
of 9