Zjowowen's workspace
Runs
7
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
env step
episode return max
episode return mean
episode return min
episode return std
q_value
target_q_value
total_loss
train iter
Crashed
-
zjowowen
39m 39s
-
0.0001
1036008
485
485
485
0
4.97767
4.95118
2.7189
100010
Crashed
-
zjowowen
3h 20s
-
0.0001
5227905
400
400
400
0
5.05246
5.05755
2.54289
504010
Crashed
-
zjowowen
2h 20m 1s
-
0.0001
4065429
515
515
515
0
4.9763
5.01393
2.53254
392010
Crashed
-
zjowowen
5h 24m 32s
-
0.0001
11833527
0
205
0
0
5.68941
5.72315
2.50059
1140010
Finished
-
zjowowen
14h 19m 54s
-
0.0001
19984140
-
745
-
-
6.14153
6.14756
2.3261
1996010
Finished
-
zjowowen
17h 50m 47s
-
0.0001
19981592
-
3585
-
-
5.96009
5.97767
2.36297
1996010
Failed
-
zjowowen
14h 44m 13s
-
0.0001
19982105
-
1590
-
-
6.13117
6.11411
2.32432
1996010
1-7
of 7