Zjowowen's workspace
Runs
5
State
Notes
User
Tags
Created
Runtime
Sweep
adv_max
adv_mean
approx_kl
clipfrac
cur_lr
entropy_loss
env step
episode return mean
episode return std
policy_loss
train iter
value_loss
value_max
value_mean
Finished
-
zjowowen
1h 46m 2s
-
4.12273
-2.9132e-9
0.025768
0.27344
0.0003
22.23952
4997506
2049.69604
500.21573
-0.0018366
156100
7.00079
44.05851
34.53643
Failed
-
zjowowen
1h 27m 18s
-
3.42796
2.0117e-10
0.036527
0.33125
0.0003
17.39586
4997559
4689.6875
43.62648
-0.0023642
156100
9.11319
41.23037
32.33515
Failed
-
zjowowen
1h 29m 37s
-
3.61571
-5.0813e-9
0.041234
0.29725
0.0003
18.83321
4997535
5219.72656
22.5792
-0.0015635
156100
2.0334
44.47774
36.69352
Finished
-
zjowowen
4s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
zjowowen
5s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1-5
of 5