Prabhasak's group workspace
Group: Pendulum-v0
Name
6 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
algo
batch_size
buffer_size
cg_damping
cg_iters
check_callback
cliprange
device
ent_coef
entcoeff
env
eval_callback
exp_id
gamma
gradient_steps
lam
learning_rate
learning_starts
max_kl
n_steps
n_timesteps
nminibatches
noptepochs
num_trajs
policy
save_best_model
seed
tensorboard
timesteps_IL
timesteps_RL
timesteps_per_batch
train_IL
train_RL
train_freq
traj_use
verbose
vf_iters
vf_stepsize
wandb_log
BC_max_iter
checkpoint_dir
env_id
env_kwargs.expert
env_kwargs.name
Finished
-
prabhasak
6s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
20
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
Pendulum-v0
-
-
Finished
-
prabhasak
6s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
10
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
Pendulum-v0
-
-
Finished
-
prabhasak
7s
-
sac
128
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
5
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
100000
models
Pendulum-v0
-
-
Finished
-
prabhasak
7m 28s
-
sac
-
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
-
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
1e5
models
Pendulum-v0
-
-
Finished
-
prabhasak
51m 37s
-
sac
-
-
0.0000235
-
false
-
gpu
-
0.01118
Pendulum-v0
true
1
0.99
-
0.9
-
-
0.000193
-
-
-
-
20
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
10
0.00428
true
-
-
-
-
-
Finished
-
prabhasak
5m 36s
-
sac
-
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
-
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
1e5
models
Pendulum-v0
-
-
Finished
-
prabhasak
45m 22s
-
sac
-
-
0.0000235
-
false
-
gpu
-
0.01118
Pendulum-v0
true
1
0.99
-
0.9
-
-
0.000193
-
-
-
-
10
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
10
0.00428
true
-
-
-
-
-
Finished
-
prabhasak
9m 13s
-
sac
-
-
-
-
-
-
gpu
-
-
-
-
1
-
-
-
-
-
-
-
-
-
-
-
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
1e5
models
Pendulum-v0
-
-
Finished
-
prabhasak
1h 5m 57s
-
sac
-
-
0.0000235
-
false
-
gpu
-
0.01118
Pendulum-v0
true
1
0.99
-
0.9
-
-
0.000193
-
-
-
-
5
MlpPolicy
true
42
true
0
0
1024
true
false
-
-
0
10
0.00428
true
-
-
-
-
-
Finished
-
prabhasak
12m 12s
-
td3
-
-
-
-
false
-
gpu
-
-
Pendulum-v0
true
1
-
-
-
-
1000
-
-
-
-
-
10
MlpPolicy
true
42
true
0
0
-
false
true
-
-
0
-
-
true
-
-
-
-
-
Finished
-
prabhasak
15m 35s
-
sac
-
-
-
-
false
-
gpu
-
-
Pendulum-v0
true
1
-
-
-
-
1000
-
-
-
-
-
10
MlpPolicy
true
42
true
0
1e5
-
false
true
-
-
0
-
-
true
-
-
-
-
-
1-11
of 11