Zhaochenyang20's workspace
Runs
4
State
Notes
User
Tags
Created
Runtime
Sweep
actor_rollout_ref.actor._target_
actor_rollout_ref.actor.checkpoint._target_
actor_rollout_ref.actor.checkpoint.async_save
actor_rollout_ref.actor.checkpoint.load_contents
actor_rollout_ref.actor.checkpoint.save_contents
actor_rollout_ref.actor.clip_ratio
actor_rollout_ref.actor.clip_ratio_c
actor_rollout_ref.actor.clip_ratio_high
actor_rollout_ref.actor.clip_ratio_low
actor_rollout_ref.actor.entropy_checkpointing
actor_rollout_ref.actor.entropy_coeff
actor_rollout_ref.actor.entropy_from_logits_with_chunking
actor_rollout_ref.actor.fsdp_config._target_
actor_rollout_ref.actor.fsdp_config.forward_prefetch
actor_rollout_ref.actor.fsdp_config.fsdp_size
actor_rollout_ref.actor.fsdp_config.offload_policy
actor_rollout_ref.actor.fsdp_config.optimizer_offload
actor_rollout_ref.actor.fsdp_config.param_offload
actor_rollout_ref.actor.fsdp_config.reshard_after_forward
actor_rollout_ref.actor.fsdp_config.wrap_policy.min_num_params
actor_rollout_ref.actor.grad_clip
actor_rollout_ref.actor.kl_loss_coef
actor_rollout_ref.actor.kl_loss_type
actor_rollout_ref.actor.loss_agg_mode
actor_rollout_ref.actor.optim._target_
actor_rollout_ref.actor.optim.lr
actor_rollout_ref.actor.optim.lr_warmup_steps
actor_rollout_ref.actor.optim.lr_warmup_steps_ratio
actor_rollout_ref.actor.optim.min_lr_ratio
actor_rollout_ref.actor.optim.num_cycles
actor_rollout_ref.actor.optim.total_training_steps
actor_rollout_ref.actor.optim.warmup_style
actor_rollout_ref.actor.optim.weight_decay
actor_rollout_ref.actor.policy_loss._target_
actor_rollout_ref.actor.policy_loss.clip_cov_lb
actor_rollout_ref.actor.policy_loss.clip_cov_ratio
actor_rollout_ref.actor.policy_loss.clip_cov_ub
actor_rollout_ref.actor.policy_loss.kl_cov_ratio
actor_rollout_ref.actor.policy_loss.loss_mode
actor_rollout_ref.actor.policy_loss.ppo_kl_coef
actor_rollout_ref.actor.ppo_epochs
actor_rollout_ref.actor.ppo_max_token_len_per_gpu
actor_rollout_ref.actor.ppo_micro_batch_size_per_gpu
actor_rollout_ref.actor.ppo_mini_batch_size
Crashed
Add notes...
zhaochenyang20
43m 33s
-
verl.workers.config.FSDPActorConfig
verl.trainer.config.CheckpointConfig
false
["model","optimizer","extra"]
["model","optimizer","extra"]
0.2
3
0.2
0.2
false
0
false
verl.workers.config.FSDPEngineConfig
false
-1
false
false
false
true
0
1
0.001
low_var_kl
token-mean
verl.workers.config.FSDPOptimizerConfig
0.000001
-1
0
0
0.5
435
constant
0.01
verl.workers.config.PolicyLossConfig
1
0.0002
5
0.0002
vanilla
0.1
1
16384
32
256
Crashed
Add notes...
zhaochenyang20
43m 31s
-
verl.workers.config.FSDPActorConfig
verl.trainer.config.CheckpointConfig
false
["model","optimizer","extra"]
["model","optimizer","extra"]
0.2
3
0.2
0.2
false
0
false
verl.workers.config.FSDPEngineConfig
false
-1
false
false
false
true
0
1
0.001
low_var_kl
token-mean
verl.workers.config.FSDPOptimizerConfig
0.000001
-1
0
0
0.5
435
constant
0.01
verl.workers.config.PolicyLossConfig
1
0.0002
5
0.0002
vanilla
0.1
1
16384
32
256
Crashed
Add notes...
zhaochenyang20
43m 16s
-
verl.workers.config.FSDPActorConfig
verl.trainer.config.CheckpointConfig
false
["model","optimizer","extra"]
["model","optimizer","extra"]
0.2
3
0.2
0.2
false
0
false
verl.workers.config.FSDPEngineConfig
false
-1
false
false
false
true
0
1
0.001
low_var_kl
token-mean
verl.workers.config.FSDPOptimizerConfig
0.000001
-1
0
0
0.5
435
constant
0.01
verl.workers.config.PolicyLossConfig
1
0.0002
5
0.0002
vanilla
0.1
1
16384
32
256
Crashed
Add notes...
zhaochenyang20
43m 15s
-
verl.workers.config.FSDPActorConfig
verl.trainer.config.CheckpointConfig
false
["model","optimizer","extra"]
["model","optimizer","extra"]
0.2
3
0.2
0.2
false
0
false
verl.workers.config.FSDPEngineConfig
false
-1
false
false
false
true
0
1
0.001
low_var_kl
token-mean
verl.workers.config.FSDPOptimizerConfig
0.000001
-1
0
0
0.5
435
constant
0.01
verl.workers.config.PolicyLossConfig
1
0.0002
5
0.0002
vanilla
0.1
1
16384
32
256
1-4
of 4