Skip to main content

Repeat

Created on July 19|Last edited on July 19

5001k1.5kglobal_step0.20.40.60.81
exp_name: train_policy_accelerate_repeat2
5001k1.5kglobal_step05101520
exp_name: train_policy_accelerate_repeat2
Run set
8
Run set 2
10
Name
10 visualized
10
State
Notes
User
Tags
Created
Runtime
Sweep
_name_or_path
a
accelerator_config.even_batches
accelerator_config.split_batches
accelerator_config.use_seedable_sampler
action_noise
action_shape
actor_buffer_size
actor_device_ids
actor_devices
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
agent_model_path
alg
algorithm
algorithm_spec.GAE
algorithm_spec.K_epoch
algorithm_spec.dueling
algorithm_spec.entropy_coeff
algorithm_spec.episodic_update
algorithm_spec.eps_clip
algorithm_spec.eps_decay
algorithm_spec.eps_final
algorithm_spec.eps_start
algorithm_spec.gamma
algorithm_spec.lambda
algorithm_spec.max_grad_norm
algorithm_spec.multi_step
algorithm_spec.policy_loss_coeff
algorithm_spec.replay_buffer_size
algorithm_spec.target_update_interval
algorithm_spec.vf_coeff
alpha
anneal-lr
anneal_lr
architectures
async_batch_size
async_update
asyncvec
attention_bias
attention_dropout
Crashed
-
costa-huang
2h 46m 16s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1-1
of 1