Skip to main content

MuJoCo CleanRL PPO vs OpenAI/Baselines PPO

Created on September 10|Last edited on September 10

2M4M6M8MSteps0100020003000400050006000Episodic Return
12345Time (minutes)0100020003000400050006000Episodic Return
CleanRL PPO + Envpool
5
jaxrl's SAC
10
Name
5 visualized
5
State
Notes
User
Tags
Created
Runtime
Sweep
anneal_lr
batch_size
capture_video
clip-vloss
clip_coef
clip_vloss
cuda
ent-coef
ent_coef
env_id
exp_name
gae
gae_lambda
gamma
learning-rate
learning_rate
max-grad-norm
max_grad_norm
minibatch_size
norm_adv
num-minibatches
num-steps
num_envs
num_minibatches
num_steps
seed
torch_deterministic
total_timesteps
track
update-epochs
update_epochs
vf-coef
vf_coef
wandb_project_name
charts/SPS
charts/avg_episodic_return
charts/episodic_length
charts/episodic_return
charts/learning_rate
global_step
losses/approx_kl
losses/clipfrac
losses/entropy
losses/explained_variance
Finished
-
costa-huang
58m 38s
-
true
4096
false
-
0.2
false
true
-
0
HalfCheetah-v4
ppo_continuous_action_envpool
true
0.95
0.99
-
0.00295
-
3.5
1024
true
-
-
64
4
64
3
true
10000000
true
-
2
-
1.3
envpool-cleanrl
29617.4
2584.12859
1001
2037.39253
0.0000012085
9998336
0.0000039901
0
-5.84674
0.87358
1-1
of 1



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10



CleanRL PPO + Envpool
5
jaxrl's SAC
10