Zjowowen's workspace
Runs
8
State
Notes
User
Tags
Created
Runtime
Sweep
alpha
alpha_loss
critic_loss
cur_lr_p
cur_lr_q
env step
episode return max
episode return mean
episode return min
episode return std
policy_loss
target_q_value
td_error
train iter
transformed_log_prob
twin_critic_loss
Finished
-
zjowowen
2h 14m 54s
-
0.032466
0.05905
15.03209
0.0003
0.001
561202
292.95325
271.75772
247.09859
15.53841
-25.25175
25.36171
15.01405
546048
0.01716
14.996
Finished
-
zjowowen
51m 41s
-
0.031797
0.14519
10.65159
0.0003
0.001
137949
-
245.0903
-
-
-24.19338
24.17662
10.64689
127232
0.042087
10.64218
Finished
-
zjowowen
1h 16m 31s
-
0.017024
0.95568
11.35492
0.0003
0.001
259586
-
245.13167
-
-
-9.8318
9.69907
11.40612
248064
0.23477
11.45732
Crashed
-
zjowowen
42m 2s
-
0.012237
-0.41956
4.28343
0.0003
0.001
-
-
-
-
-
-1.60407
1.62675
4.40731
-
-0.095139
4.53118
Finished
-
zjowowen
46m 23s
-
0.020727
-0.87643
9.06894
0.0003
0.001
167822
-
263.19696
-
-
-1.9191
1.90044
9.06883
156928
-0.22596
9.06872
Failed
-
zjowowen
59m 19s
-
0.033426
-0.22308
6.24685
0.0003
0.001
229638
-
247.87173
-
-
-2.23546
2.18573
6.14069
218368
-0.06566
6.03453
Finished
-
zjowowen
37m 6s
-
0.015406
0.30497
2.71449
0.0003
0.001
147087
-
259.39627
-
-
0.25546
-0.30784
2.72988
136448
0.073049
2.74528
Failed
-
zjowowen
43m 30s
-
0.025077
0.40297
11.60069
0.0003
0.001
183448
-
240.54596
-
-
-10.77445
10.75815
11.53393
172288
0.1095
11.46717
1-8
of 8