Skip to main content
costa-huang
Projects
cleanRL
Reports
Deepmind Mujoco vs openai/mujoco_py
Log in
Sign up
Share
Comment
1 star
Share
Comment
1 star
Deepmind Mujoco vs openai/mujoco_py
Costa Huang
Created on March 27
|
Last edited on March 27
Comment
CleanRL's ppo_continuous_action.py
CleanRL's ppo_continuous_action.py
Select runs that logged charts/episodic_return
to visualize data in this line chart.
Run set
3
Run set 2
87
Name
87 visualized
env_id: Hopper-v4
env_id: Hopper-v4
87
01-10 of 87
State
Notes
User
Tags
Created
Runtime
Sweep
actor_device_ids
actor_devices
adv_norm_fullbatch
alpha
anneal_lr
anneal_steps
async_batch_size
async_update
autotune
aux_batch_rollouts
backend
base_model
batch_size
beta_clone
buffer_size
capture_video
channels
clip_coef
clip_vloss
concurrency
cuda
debug_normalize
deepspeed
device_ids
discount
distill_batch_size
distill_beta
distill_learning_rate
distill_update_epochs
distributed
dropout_rate
e_auxiliary
e_policy
end_e
ent_coef
env
env_id
eps
eval_every
eval_freq
exp_name
expl_noise
exploration_fraction
exploration_noise
Crashed
-
costa-huang
3y ago
11mo 22d 23h 22m 33s
-
-
-
-
0.2
true
-
-
-
true
-
-
-
1677.24138
-
1000000
[false,true]
-
0.2
true
-
[false,true]
-
-
-
-
-
-
-
-
-
-
-
-
-
0
-
Hopper-v4
-
-
-
["ddpg_continuous_action","ddpg_continuous_action_jax","ppo_continuous_action","ppo_continuous_action_envpool_xla_jax_scan","rpo_continuous_action_alpha_0_01","rpo_continuous_action_alpha_0_05","rpo_continuous_action_alpha_0_1","sac_continuous_action","td3_continuous_action","td3_continuous_action_jax"]
-
-
0.1
1-1
of 1
Run set
3
Run set 2
107
Run set
3
Run set 2
87
Add a comment