Zjowowen's workspace
Runs
3
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
env step
episode return max
episode return mean
episode return min
episode return std
q_value
target_q_value
total_loss
train iter
Finished
-
zjowowen
7h 12m 53s
-
0.0001
7462344
2100
2100
2100
0
5.5384
5.54822
0.12448
719010
Finished
-
zjowowen
4h 55m 15s
-
0.0001
9796613
0
2060
0
0
5.85005
-
0.11972
944010
Failed
-
zjowowen
3h 49m 15s
-
0.0001
8010699
0
2275
0
0
6.27568
-
0.13873
772010
1-3
of 3