Skip to main content

different base models

Created on September 13|Last edited on September 18

5001k1.5kglobal_step012
exp_name: train_policy_accelerate Run set 2
exp_name: train_policy_accelerate, ppo.gradient_accumulation_steps: 1, base_model: EleutherAI/pythia-160m Run set
exp_name: train_policy_accelerate, ppo.gradient_accumulation_steps: 1, base_model: cerebras/Cerebras-GPT-111M Run set
exp_name: train_policy_accelerate, ppo.gradient_accumulation_steps: 64, base_model: gpt2 Run set
exp_name: train_policy_accelerate, ppo.gradient_accumulation_steps: 1, base_model: gpt2 Run set
5001k1.5kglobal_step051015
100200300400500Time (minutes)20406080100
100200300400500Time (minutes)20406080
Run set
19
Run set 2
5