Comment
Section 1
charts/episode_reward
charts/episode_reward
Run set
6
Name
6 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
action_noise
actor_buffer_size
alg
alpha
anneal_lr
aux_batch_size
aux_minibatch_size
batch_size
beta_clone
buffer_size
built_in_ais
capture_video
clip_coef
clip_vloss
cuda
e_auxiliary
e_policy
end_e
end_sigma
ent_coef
env
env_id
episode_length
eta
eval_frequency
eval_levels
exp_name
exploration_fraction
ext_coef
features_turned_on
gae
gae_lambda
gamma
griddly_level
griddly_max_steps
gym_id
hidden_sizes
int_coef
int_gamma
kl
kle_rollback
kle_stop
learning_rate
learning_starts
Finished
-
bragajj
4s
-
-
-
-
-
true
-
-
512
-
-
-
false
0.2
true
true
-
-
-
-
0.01
-
CartPole-v0
-
-
-
-
ppo
-
-
-
true
0.95
0.99
-
-
-
-
-
-
-
-
-
0.00025
-
Finished
-
costa-huang
6h 23m 30s
-
-
-
ppo2
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Failed
-
costa-huang
13s
-
-
-
ppo2
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
costa-huang
38m 47s
-
-
-
ppo2
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Killed
-
costa-huang
11m 46s
-
-
-
ppo2
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Crashed
-
dosssman
59s
-
-
-
-
-
true
-
-
1024
-
-
-
false
0.1
true
true
-
-
-
-
0.01
-
-
-
-
-
-
ppo_atari_lstm
-
-
-
true
0.95
0.99
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
0.00025
-
Finished
-
dosssman
11h 29s
-
-
-
-
-
true
-
-
1024
-
-
-
false
0.1
true
true
-
-
-
-
0.01
-
-
-
-
-
-
ppo_atari_lstm_noframestack
-
-
-
true
0.95
0.99
-
-
BreakoutNoFrameskip-v4
-
-
-
-
-
-
0.00025
-
Finished
-
costa-huang
14s
-
-
-
-
-
-
-
-
-
-
-
["randomBiasedAI","workerRushAI","lightRushAI","coacAI"]
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
league
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
costa-huang
53s
-
-
-
-
-
true
-
-
512
-
-
-
true
0.2
true
true
-
-
-
-
0.01
-
-
-
-
-
-
ppo
-
-
-
true
0.95
0.99
-
-
CartPole-v0
-
-
-
-
false
false
0.0003
-
Finished
-
costa-huang
55s
-
-
-
-
-
true
-
-
512
-
-
-
true
0.2
true
true
-
-
-
-
0.01
-
-
-
-
-
-
ppo
-
-
-
true
0.95
0.99
-
-
CartPole-v0
-
-
-
-
false
false
0.0003
-
1-10
of 342
Section 2
Run set
14
Section 3
Run set
10
Add a comment
Created with ❤️ on Weights & Biases.
https://wandb.ai/cleanrl/cleanRL/reports/PPO-reward-normalization-technique-comparison--Vmlldzo0NzkxMA