Zjowowen's workspace
Runs
9
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
env step
episode return max
episode return mean
episode return min
episode return std
q_value
target_q_value
total_loss
train iter
Finished
-
zjowowen
12h 38m 7s
-
0.0001
19984716
8025
8025
8025
0
7.25182
7.30404
1.79377
1928010
Finished
-
zjowowen
11h 52m 5s
-
0.0001
19982394
8800
8800
8800
0
7.30814
7.33372
1.7497
1928010
Crashed
-
zjowowen
8h 18m 10s
-
0.0001
13097724
4550
4550
4550
0
7.17135
7.20432
1.75292
1264010
Crashed
-
zjowowen
4h 43m 3s
-
0.0001
6997506
4375
4375
4375
0
7.02061
7.11858
1.82554
676010
Finished
-
zjowowen
11h 44m 11s
-
0.0001
19980275
575
575
575
0
7.24365
7.25962
1.81162
1928010
Failed
-
zjowowen
9h 10m 19s
-
0.0001
19982786
0
12775
0
0
6.60279
6.66989
1.91959
1928010
Finished
-
zjowowen
14h 9m 29s
-
0.0001
19968204
-
4450
-
-
6.91224
6.97457
1.64894
1992010
Finished
-
zjowowen
14h 33m 20s
-
0.0001
19969397
-
7850
-
-
7.41574
7.37155
1.68515
1992010
Failed
-
zjowowen
14h 18m 56s
-
0.0001
19969081
-
7750
-
-
7.25728
7.25442
1.69669
1992010
1-9
of 9