main v. main
main
@8a943d9/Reset train dataloder at each iteration/2023-07-19
main
@288d4cb/Fix: rename model_tok to tokenizer is reward_fn arg (#534)/2023-07-20
Created on July 20|Last edited on July 20
Comment
ppo_randomwalks/randomwalks/1gpu
ilql_sentiments/gpt2/1gpu
sft_sentiments/gpt2/1gpu
ppo_sentiments_t5/t5-imdb/1gpu
ilql_randomwalks/GPT2Config/1gpu
ppo_sentiments/gpt2-imdb/1gpu
ppo_hh/pythia-6B-static-sft/7gpus
Add a comment