Prabhasak's group workspace
Group: HalfCheetah-v2
State
Notes
User
Tags
Created
Runtime
Sweep
algo
batch_size
buffer_size
cg_damping
cg_iters
check_callback
cliprange
device
ent_coef
entcoeff
env
eval_callback
exp_id
gamma
gradient_steps
lam
learning_rate
learning_starts
max_kl
n_steps
n_timesteps
nminibatches
noptepochs
num_trajs
policy
save_best_model
seed
tensorboard
timesteps_IL
timesteps_RL
timesteps_per_batch
train_IL
train_RL
train_freq
traj_use
verbose
vf_iters
vf_stepsize
wandb_log
BC_max_iter
checkpoint_dir
env_id
env_kwargs.expert
env_kwargs.name
Finished
-
prabhasak
56m 53s
-
sac
256
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
10
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
5e5
models
HalfCheetah-v2
-
-
Finished
-
prabhasak
39m 25s
-
sac
256
-
-
-
-
-
gpu
-
-
-
-
2
-
-
-
-
-
-
-
-
-
-
5
-
-
42
-
-
-
-
-
-
-
-
0
-
-
true
5e5
models
HalfCheetah-v2
-
-
Finished
-
prabhasak
5h 53m 32s
-
sac
-
-
0.1
15
false
-
gpu
-
0
HalfCheetah-v2
true
2
0.99
-
0.95
-
-
0.01
-
-
-
-
5
MlpPolicy
true
42
true
0
0
2048
true
false
-
-
0
5
0.001
true
-
-
-
-
-
Finished
-
prabhasak
7h 27m 59s
-
sac
256
1000000
-
-
false
-
gpu
auto
-
HalfCheetah-v2
true
2
0.99
1
-
0.0003
10000
-
-
-
-
-
10
CustomSACPolicy
true
42
true
0
0
-
false
true
1
-
0
-
-
true
-
-
-
-
-
Finished
-
prabhasak
12h 16m 43s
-
sac
256
1000000
-
-
false
-
gpu
auto
-
HalfCheetah-v2
true
1
0.99
1
-
0.0003
10000
-
-
-
-
-
10
CustomSACPolicy
true
42
true
0
0
-
false
true
1
-
0
-
-
true
-
-
-
-
-
Finished
-
prabhasak
1h 25m 46s
-
trpo
-
-
0.1
15
false
-
gpu
-
0
HalfCheetah-v2
true
1
0.99
-
0.95
-
-
0.01
-
-
-
-
10
MlpPolicy
true
42
true
0
0
2048
false
true
-
-
0
5
0.001
true
-
-
-
-
-
1-6
of 6