Skip to main content

half-exp-fix v. main

half-exp-fix @4b095df/Merge branch 'main' into half-exp-fix/2023-03-20 main @79bfc6b/fix(docs): Update incorrect `PPORLElement` logprob tensor shape hint (#377)/2023-03-17
Created on March 20|Last edited on March 20

ppo_randomwalks/randomwalks/1gpu


050100150Step10203040506070
050100150Step0.20.40.60.8
050100150Step0.20.40.60.8
Run set
2



Run set
2


sft_sentiments/gpt2/1gpu


Run set
2



Run set
2


ppo_hh/pythia-6B-static-sft/7gpus


Run set
0



Run set
0


ilql_sentiments/gpt2/1gpu


Run set
2



Run set
2


ilql_randomwalks/GPT2Config/1gpu


Run set
2



Run set
2


ppo_hh/gpt-j-6B/7gpus


Run set
0



Run set
0


ppo_sentiments/gpt2-imdb/1gpu


Run set
2



Run set
2