Skip to main content

fix-kl-controller v. main

fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
Created on March 10|Last edited on March 10

ilql_randomwalks/GPT2Config/1gpu


050100150Step00.10.20.30.4
050100150Step5060708090100
050100150Step00.20.40.60.8
Run set
2



Run set
2


ppo_sentiments_t5/t5-imdb/1gpu


Run set
0



Run set
0


ppo_randomwalks/randomwalks/1gpu


Run set
2



Run set
2