Zjowowen's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
alpha
alpha_loss
critic_loss
cur_lr_p
cur_lr_q
env step
episode return max
episode return mean
episode return min
episode return std
policy_loss
target_q_value
td_error
train iter
transformed_log_prob
twin_critic_loss
Finished
-
zjowowen
42m 45s
-
0.0078033
0.61742
1.08194
0.0003
0.0003
199829
311.45129
253.93079
25.08729
114.42188
-26.76137
26.76925
1.08149
188480
0.12718
1.08103
1-1
of 1