Resume from checkpoint
Created on July 29|Last edited on July 29
Comment
Run set
1
When resuming from the checkpoint, I would expect eval/loss to continue decreasing on its trend but it suddenly increases for a little while.
Here are a few runs with different warmup_steps.
Run set
5
Add a comment