Olmo 1B replication
Created on June 8|Last edited on May 12
Comment
NB we only log every 1000 steps from olmo, which smooths out the training curve a lot
Section 1
This set of panels contains runs from a private project, which cannot be shown in this report
This set of panels contains runs from a private project, which cannot be shown in this report
Add a comment