Zjowowen's workspace
Runs
4
State
Notes
User
Tags
Created
Runtime
Sweep
alpha
alpha_loss
critic_loss
cur_lr_p
cur_lr_q
env step
episode return max
episode return mean
episode return min
episode return std
policy_loss
target_q_value
td_error
train iter
transformed_log_prob
twin_critic_loss
Finished
-
zjowowen
1m 42s
-
0.16501
0.62528
0.43125
0.001
0.001
48999
-3.09953
-238.01773
-853.45752
260.74295
73.46294
-69.72065
0.39718
3001
0.347
0.36311
Finished
-
zjowowen
40m 24s
-
0.14681
-0.39501
5.63941
0.001
0.001
103398
-
-245.36868
-
-
123.38284
-120.09315
4.60845
6401
-0.20589
3.5775
Failed
-
zjowowen
2m 34s
-
0.15942
-0.082233
2.81722
0.001
0.001
101800
-
-229.84497
-
-
107.30708
-104.34152
2.15438
6301
-0.044783
1.49154
Finished
-
zjowowen
20m 24s
-
0.15942
-0.082233
2.81722
0.001
0.001
101800
-
-229.84497
-
-
107.30708
-104.34152
2.15438
6301
-0.044783
1.49154
1-4
of 4