Zjowowen's workspace
Runs
9
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
env step
episode return max
episode return mean
episode return min
episode return std
q_value
target_q_value
total_loss
train iter
Finished
-
zjowowen
3h 14m 25s
-
0.001
3994472
260.54919
133.94839
-56.09517
134.71391
12.65057
12.34687
2.9427
618010
Finished
-
zjowowen
3h 3m 20s
-
0.001
3994256
249.90398
43.34831
-84.82804
127.59538
10.2887
10.23084
3.07009
617010
Finished
-
zjowowen
2h 51m 14s
-
0.001
3997275
236.38025
16.04584
-99.58553
123.38306
11.88602
11.87957
2.88886
618010
Finished
-
zjowowen
2h 52m 19s
-
0.001
3994737
165.91797
-62.0942
-200.76927
105.33772
7.11718
7.09567
3.10749
618010
Finished
-
zjowowen
2h 50m 26s
-
0.001
3994695
229.62076
105.6329
-67.02738
110.53143
12.41713
12.29926
3.24595
618010
Finished
-
zjowowen
2h 54m 6s
-
0.001
3996624
247.13239
10.33449
-339.1915
177.85355
12.64169
12.74121
2.98057
618010
Finished
-
zjowowen
2h 49m 29s
-
0.001
3997681
245.11937
110.01527
-86.74774
133.28079
13.0535
13.03076
2.94308
618010
Failed
-
zjowowen
19m 38s
-
0.001
446551
39.31585
219.0278
39.31585
39.31585
11.2545
10.6579
3.01696
69010
Failed
-
zjowowen
52m 45s
-
0.001
956881
-
248.1199
-
-
11.89864
12.16233
2.98176
148010
1-9
of 9