fix-ilql-sample-endings v. main
fix-ilql-sample-endings
@20eec76/fix(ilql_randomwalks): bump `seq_length` to not trim any samples/2023-03-10
main
@ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
Created on March 13|Last edited on March 13
Comment
ppo_hh/pythia-6B-static-sft/7gpus
ilql_sentiments/gpt2/1gpu
sft_sentiments/gpt2/1gpu
ppo_sentiments/gpt2-imdb/1gpu
ilql_randomwalks/GPT2Config/1gpu
ppo_randomwalks/randomwalks/1gpu
Add a comment