Zjowowen's workspace
Runs
6
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
entropy_loss
env step
episode return max
episode return mean
episode return min
episode return std
policy_loss
total_loss
train iter
value_loss
Finished
-
zjowowen
15h 10m 3s
-
0.0006
1.64279
19483215
21
21
21
0
0.0063362
-0.0076959
38002
0.0047915
Failed
-
zjowowen
13h 26m 28s
-
0.0006
1.67391
19479758
20
20
20
0
0.010389
-0.0041359
38002
0.0044293
Failed
-
zjowowen
14h 15m 2s
-
0.0006
1.68584
19483204
21
21
21
0
0.00087419
-0.0144
38002
0.0031679
Failed
-
zjowowen
13h 47m 6s
-
0.0006
1.6104
19483837
21
21
21
0
0.011585
-0.00081234
38002
0.0074135
Failed
-
zjowowen
22s
-
-
-
-
-
-
-
-
-
-
-
-
Failed
-
zjowowen
28s
-
-
-
-
-
-
-
-
-
-
-
-
1-6
of 6