Zjowowen's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
adv_max
adv_mean
approx_kl
clipfrac
cur_lr
entropy_loss
env step
episode return mean
episode return std
policy_loss
train iter
value_loss
value_max
value_mean
Finished
-
zjowowen
2h 52m 41s
-
3.86491
-3.8892e-9
0.025813
0.26781
0.0003
9.09391
9991791
2877.20459
866.46191
-0.016857
312100
1.09971
43.14992
38.3904
1-1
of 1