Prabhasak's group workspace
Group: LunarLanderContinuous-v2
Name
6 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
algo
batch_size
buffer_size
cg_damping
cg_iters
check_callback
cliprange
device
ent_coef
entcoeff
env
eval_callback
exp_id
gamma
gradient_steps
lam
learning_rate
learning_starts
max_kl
n_steps
n_timesteps
nminibatches
noptepochs
num_trajs
policy
save_best_model
seed
tensorboard
timesteps_IL
timesteps_RL
timesteps_per_batch
train_IL
train_RL
train_freq
traj_use
verbose
vf_iters
vf_stepsize
wandb_log
BC_max_iter
checkpoint_dir
env_id
env_kwargs.expert
env_kwargs.name
Finished
-
prabhasak
5s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
20
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
6s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
10
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
6s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
5
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
12h 28m 33s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.995
-
0.98
-
-
0.01
-
-
-
-
20
MlpPolicy
true
42
true
2e6
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
5h 18m 7s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.995
-
0.98
-
-
0.01
-
-
-
-
20
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
6h 21m
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.995
-
0.98
-
-
0.01
-
-
-
-
10
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
6h 47m 53s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.995
-
0.98
-
-
0.01
-
-
-
-
5
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
7m 12s
-
sac
256
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
20
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
1e5
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
3h 51m 23s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.99
-
0.98
-
-
0.01
-
-
-
-
20
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
23m 4s
-
sac
256
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
10
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
3e5
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
5h 13m
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.99
-
0.98
-
-
0.01
-
-
-
-
10
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
1h 27m 48s
-
sac
256
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
5
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
1e6
models
LunarLanderContinuous-v2
-
-
Finished
-
prabhasak
5h 51m 32s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.99
-
0.98
-
-
0.01
-
-
-
-
5
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
5h 7m 54s
-
sac
-
-
0.1
10
false
-
gpu
-
0
LunarLanderContinuous-v2
true
1
0.99
-
0.98
-
-
0.01
-
-
-
-
5
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
56m 56s
-
td3
256
-
-
-
false
-
gpu
-
-
LunarLanderContinuous-v2
true
1
-
-
-
-
1000
-
-
-
-
-
10
MlpPolicy
true
42
true
0
0
-
false
true
-
-
0
-
-
true
-
-
-
-
-
Finished
-
prabhasak
2h 18m 37s
-
sac
256
-
-
-
false
-
gpu
-
-
LunarLanderContinuous-v2
true
1
-
-
-
-
1000
-
-
-
-
-
10
MlpPolicy
true
42
true
0
0
-
false
true
-
-
0
-
-
true
-
-
-
-
-
1-16
of 16