fix-kl-controller v. main
fix-kl-controller
@ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08
main
@adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
Created on March 13|Last edited on March 13
Comment
ppo_randomwalks/randomwalks/1gpu
Run set
2
Run set
2
sft_sentiments/gpt2/1gpu
Run set
2
Run set
2
ppo_sentiments/gpt2-imdb/1gpu
Run set
2
Run set
2
ilql_sentiments/gpt2/1gpu
Run set
2
Run set
2
ilql_randomwalks/GPT2Config/1gpu
Run set
2
Run set
2
ppo_hh/pythia-6B-static-sft/7gpus
Run set
2
Run set
2
Add a comment