Skip to main content

value-branch v. main

value-branch @d48c1a4/Fix: Include num_value_layers_frozen arg in Seq2Seq model init/2023-07-26 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
Created on July 29|Last edited on July 29

ppo_hh/pythia-6B-static-sft/7gpus


02004006008001k1.2k1.4kStep-1.6-1.4-1.2-1
Run set
2



Run set
2


ppo_sentiments_t5/t5-imdb/1gpu


Run set
1



Run set
1


ppo_sentiments/gpt2-imdb/1gpu


Run set
2



Run set
2


ppo_randomwalks/randomwalks/1gpu


Run set
2



Run set
2


ilql_randomwalks/GPT2Config/1gpu


Run set
2



Run set
2


sft_sentiments/gpt2/1gpu


Run set
2



Run set
2


ilql_sentiments/gpt2/1gpu


Run set
2



Run set
2