Zjowowen's workspace
Runs
1
Name
1 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
alpha
critic_loss
cur_lr_p
cur_lr_q
env step
episode return max
episode return mean
episode return min
episode return std
policy_loss
target_q_value
td_error
train iter
transformed_log_prob
twin_critic_loss
Finished
-
zjowowen
1d 15h 47s
-
0.2
740.29242
0.001
0.001
5000000
4796.23926
4781.27148
4731.92285
19.2376
-408.38293
408.39368
738.67804
4990001
1.17905
737.06354
1-1
of 1