half-exp-fix v. main
half-exp-fix
@1a243ae/fixes half exp not implemented error/2023-03-13
main
@ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
Created on March 13|Last edited on March 13
Comment
sft_sentiments/gpt2/1gpu
ppo_sentiments/gpt2-imdb/1gpu
ilql_sentiments/gpt2/1gpu
ilql_randomwalks/GPT2Config/1gpu
ppo_hh/pythia-6B-static-sft/7gpus
ppo_randomwalks/randomwalks/1gpu
Add a comment