Skip to main content

main v. main

main @fe33681/Fix ordering of ppo epoch iteration/2023-07-13 main @1446523/Update README.md/2023-07-10
Created on July 13|Last edited on July 13

ilql_randomwalks/GPT2Config/1gpu


050100150200Step5060708090
050100150200Step406080100
050100150200Step00.20.40.60.8
Run set
2



Run set
2


ilql_sentiments/gpt2/1gpu


Run set
2



Run set
2


ppo_sentiments_t5/t5-imdb/1gpu


Run set
2



Run set
2


ppo_hh/pythia-6B-static-sft/7gpus


Run set
2



Run set
2


ppo_randomwalks/randomwalks/1gpu


Run set
2



Run set
2


ppo_sentiments/gpt2-imdb/1gpu


Run set
2



Run set
2


sft_sentiments/gpt2/1gpu


Run set
2



Run set
2