Zjowowen's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
adv_max
adv_mean
approx_kl
clipfrac
cur_lr
entropy_loss
env step
episode return mean
episode return std
policy_loss
train iter
value_loss
value_max
value_mean
Finished
-
zjowowen
4h 23m 36s
-
2.30737
-3.7253e-9
0.12468
0.47292
0.0003
4.9591
3995242
-1119.85315
223.97223
0.046936
596760
83.29633
-11.14634
-45.85158
1-1
of 1