Prabhasak's group workspace
Group: CartPole-v1
State
Notes
User
Tags
Created
Runtime
Sweep
algo
batch_size
buffer_size
cg_damping
cg_iters
check_callback
cliprange
device
ent_coef
entcoeff
env
eval_callback
exp_id
gamma
gradient_steps
lam
learning_rate
learning_starts
max_kl
n_steps
n_timesteps
nminibatches
noptepochs
num_trajs
policy
save_best_model
seed
tensorboard
timesteps_IL
timesteps_RL
timesteps_per_batch
train_IL
train_RL
train_freq
traj_use
verbose
vf_iters
vf_stepsize
wandb_log
BC_max_iter
checkpoint_dir
env_id
env_kwargs.expert
env_kwargs.name
Finished
prabhasak
3h 49m 26s
-
dqn
-
-
0.001
10
false
-
gpu
-
0
CartPole-v1
true
1
0.99
-
1
-
-
0.001
-
-
-
-
5
MlpPolicy
true
42
true
0
0
512
true
false
-
-
0
3
0.0001
true
-
-
-
-
-
Finished
prabhasak
1h 58m 20s
-
dqn
-
-
0.001
10
false
-
gpu
-
0
CartPole-v1
true
1
0.99
-
1
-
-
0.001
-
-
-
-
5
MlpPolicy
true
42
true
5e5
0
512
true
false
-
-
0
3
0.0001
true
-
-
-
-
-
Finished
prabhasak
12m 17s
-
ppo2
-
-
-
-
false
lin_0.2
gpu
0
-
CartPole-v1
true
1
0.98
-
0.8
lin_0.001
-
-
32
-
1
20
10
MlpPolicy
true
42
true
0
0
-
false
true
-
-
0
-
-
true
-
-
-
-
-
Finished
prabhasak
21m 5s
-
dqn
-
50000
-
-
false
-
gpu
-
-
CartPole-v1
true
1
-
-
-
0.001
-
-
-
-
-
-
10
CustomDQNPolicy
true
42
true
0
0
-
false
true
-
-
0
-
-
true
-
-
-
-
-
1-4
of 4