half-exp-fix v. main
half-exp-fix
@4b095df/Merge branch 'main' into half-exp-fix/2023-03-20
main
@79bfc6b/fix(docs): Update incorrect `PPORLElement` logprob tensor shape hint (#377)/2023-03-17
Created on March 20|Last edited on March 20
Comment
ppo_randomwalks/randomwalks/1gpu
sft_sentiments/gpt2/1gpu
ppo_hh/pythia-6B-static-sft/7gpus
ilql_sentiments/gpt2/1gpu
ilql_randomwalks/GPT2Config/1gpu
ppo_hh/gpt-j-6B/7gpus
ppo_sentiments/gpt2-imdb/1gpu
Add a comment