Skip to main content

Online DPO experiments for TL;DR summarisation

Created on August 28|Last edited on August 29

Section 1


5001k1.5k2k2.5ktrain/global_step00.20.40.60.8
Run set
2



Run set
2



Run set
2



Run set
2



Run set
2



Run set
2



Run set
2