Zjowowen's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
adv_max
adv_mean
approx_kl
clipfrac
cur_lr
entropy_loss
env step
episode return mean
episode return std
policy_loss
train iter
value_loss
value_max
value_mean
Finished
-
zjowowen
4h 45m 53s
-
2.04089
-4.9671e-10
0.18095
0.40677
0.0003
1.29073
4998400
-1505.93567
172.49503
0.013037
749760
0.4937
-25.43181
-27.81519
1-1
of 1