Zjowowen's workspace
Runs
2
State
Notes
User
Tags
Created
Runtime
Sweep
action
actor_loss
critic_loss
cur_lr_actor
cur_lr_critic
env step
episode return max
episode return mean
episode return min
episode return std
q_value
td_error
total_loss
train iter
Finished
-
zjowowen
2h 26m 10s
-
-0.16343
-1100.82605
44.80353
0.001
0.001
675000
11152.92383
11073.76953
10957.11816
58.74898
1098.17456
44.80353
-1056.02246
650001
Finished
-
zjowowen
2h 27m 16s
-
-0.16343
-1100.82605
44.80353
0.001
0.001
675000
11152.92383
11073.76953
10957.11816
58.74898
1098.17456
44.80353
-1056.02246
650001
1-2
of 2