Gaoruoyu751's workspace
Runs
3
State
Notes
User
Tags
Created
Runtime
Sweep
cur_lr
env step
episode return mean
q_value
total_loss
train iter
Failed
-
gaoruoyu751
4d 20h 26m 49s
-
0.0001
129164398
2616
7.55008
0.16008
12432010
Crashed
-
gaoruoyu751
18h 59m 35s
-
0.0001
-
-
16.62677
0.82869
-
Failed
-
gaoruoyu751
5h 14m 1s
-
0.0001
7541436
48
14.71839
0.48517
729010
1-3
of 3