Comment
StarPilot
StarPilot
Run set
6
Name
6 visualized
gym_id: starpilot
gym_id: starpilot
2
6
State
Notes
User
Tags
Created
Runtime
Sweep
alg
env
exp_name
network
num_env
num_timesteps
play
reward_scale
save_video_interval
save_video_length
seed
track
anneal_lr
batch_size
capture_video
clip_coef
clip_vloss
cuda
ent_coef
gae
gae_lambda
gamma
gym_id
learning_rate
max_grad_norm
minibatch_size
norm_adv
num_envs
num_minibatches
num_steps
torch_deterministic
total_timesteps
update_epochs
vf_coef
wandb_entity
wandb_project_name
aux_batch_size
aux_minibatch_size
beta_clone
e_auxiliary
e_policy
n_aux_grad_accum
n_aux_minibatch
n_iteration
Finished
costa-huang
2d 6h 49m 32s
-
ppo2
starpilot
["baselines-ppo2-None","ppo_procgen"]
-
64
25000000
false
1
0
200
2
true
false
16384
false
0.2
true
true
0.01
true
0.95
0.999
starpilot
0.0005
0.5
2048
true
64
8
256
true
25000000
3
0.5
vwxyzjn
ppo-details
-
-
-
-
-
-
-
-
1-1
of 1
Run set
6
Run set
6
Add a comment
Created with ❤️ on Weights & Biases.
https://wandb.ai/vwxyzjn/ppo-details/reports/Procgen-Our-PPO-vs-openai-baselines-PPO--VmlldzoxNTAxOTY0