Skip to main content

hh-gpt-j

Created on March 17|Last edited on March 17
Reproduction of Max's (@reciprocate) HH example but saved with value heads.


Results


02k4k6k8k10kStep-2.5-2-1.5-1
Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1



Run: ppo_hh/gpt-j-6B/7gpus:ppo-hh
1