Skip to main content

fix-kl-controller v. main

fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
Created on March 13|Last edited on March 13

ppo_hh/pythia-6B-static-sft/7gpus


02004006008001k1.2k1.4kStep-2.3-2.2-2.1-2-1.9
Run set
2



Run set
2


ilql_sentiments/gpt2/1gpu


Run set
2



Run set
2


ppo_sentiments/gpt2-imdb/1gpu


Run set
2



Run set
2


sft_sentiments/gpt2/1gpu


Run set
2



Run set
2


ppo_randomwalks/randomwalks/1gpu


Run set
2



Run set
2


ilql_randomwalks/GPT2Config/1gpu


Run set
2



Run set
2