Skip to main content

main v. main

main @8a943d9/Reset train dataloder at each iteration/2023-07-19 main @288d4cb/Fix: rename model_tok to tokenizer is reward_fn arg (#534)/2023-07-20
Created on July 20|Last edited on July 20

ppo_randomwalks/randomwalks/1gpu


050100150Step10203040506070
050100150Step0.20.40.60.8
050100150Step0.20.40.60.8
Run set
1



Run set
1


ilql_sentiments/gpt2/1gpu


Run set
1



Run set
1


sft_sentiments/gpt2/1gpu


Run set
1



Run set
1


ppo_sentiments_t5/t5-imdb/1gpu


Run set
1



Run set
1


ilql_randomwalks/GPT2Config/1gpu


Run set
1



Run set
1


ppo_sentiments/gpt2-imdb/1gpu


Run set
1



Run set
1


ppo_hh/pythia-6B-static-sft/7gpus


Run set
1



Run set
1