Skip to main content

half-exp-fix v. main

half-exp-fix @1a243ae/fixes half exp not implemented error/2023-03-13 main @ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
Created on March 13|Last edited on March 13

sft_sentiments/gpt2/1gpu


02004006008001kStep0.550.60.650.70.750.8
Run set
1



Run set
1


ppo_sentiments/gpt2-imdb/1gpu


Run set
1



Run set
1


ilql_sentiments/gpt2/1gpu


Run set
1



Run set
1


ilql_randomwalks/GPT2Config/1gpu


Run set
1



Run set
1


ppo_hh/pythia-6B-static-sft/7gpus


Run set
1



Run set
1


ppo_randomwalks/randomwalks/1gpu


Run set
1



Run set
1