Comment
BreakoutNoFrameskip-v4
BreakoutNoFrameskip-v4
Run set
6
Name
6 visualized
gym_id: BreakoutNoFrameskip-v4
gym_id: BreakoutNoFrameskip-v4
4
12
State
Notes
User
Tags
Created
Runtime
Sweep
alg
env
exp_name
network
num_env
num_timesteps
play
reward_scale
save_video_interval
save_video_length
seed
track
anneal_lr
batch_size
capture_video
clip_coef
clip_vloss
cuda
ent_coef
gae
gae_lambda
gamma
gym_id
learning_rate
max_grad_norm
minibatch_size
norm_adv
num_envs
num_minibatches
num_steps
torch_deterministic
total_timesteps
update_epochs
vf_coef
wandb_entity
wandb_project_name
aux_batch_size
aux_minibatch_size
beta_clone
e_auxiliary
e_policy
n_aux_grad_accum
n_aux_minibatch
n_iteration
Finished
-
costa-huang
4d 26m 48s
-
ppo2
BreakoutNoFrameskip-v4
["baselines-ppo2-cnn","baselines-ppo2-no-framestack-cnn_lstm","ppo_atari","ppo_atari_lstm"]
["cnn","cnn_lstm"]
8
10000000
false
1
0
200
2
true
true
1024
false
0.1
true
true
0.01
true
0.95
0.99
BreakoutNoFrameskip-v4
0.00025
0.5
256
true
8
4
128
true
10000000
4
0.5
vwxyzjn
ppo-details
-
-
-
-
-
-
-
-
1-1
of 1
Run set
6
Run set
6
Add a comment
Created with ❤️ on Weights & Biases.
https://wandb.ai/vwxyzjn/ppo-details/reports/Atari-Our-PPO-vs-openai-baselines-PPO--VmlldzoxODAxMzI0