Zjowowen's workspace
Runs
2
State
Notes
User
Tags
Created
Runtime
Sweep
adv_abs_max
cur_lr
entropy_loss
env step
episode return mean
grad_norm
policy_loss
total_loss
train iter
value_loss
Finished
-
zjowowen
29m 5s
-
4.53529
0.0003
4.65941
902421
1004.91345
53.35189
-0.067184
1.09228
7001
2.50531
Finished
-
zjowowen
29m 19s
-
3.11507
0.0003
4.80501
900803
183.76953
877.20538
0.15564
148.42065
7001
296.72223
1-2
of 2