Skip to main content
ai2-llm
Projects
olmoe
Reports
Plot: Init
Log in
Sign up
Share
Comment
Star
Share
Comment
Star
Plot: Init
Niklas Muennighoff
Created on June 23
|
Last edited on August 29
Comment
eval/c4_en-validation/CrossEntropyLoss
eval/c4_en-validation/CrossEntropyLoss
50k
100k
150k
200k
250k
300k
Step
3
4
5
6
7
olmoe17-8x1b-fullshard-swiglu-wrapb-k2-init
Run set
olmoe17-8x1b-fullshard-swiglu-wrapb-k2-scratch
Run set
train/CrossEntropyLoss
train/CrossEntropyLoss
50k
100k
150k
200k
250k
300k
Step
4
6
8
10
olmoe17-8x1b-fullshard-swiglu-wrapb-k2-init
Run set
olmoe17-8x1b-fullshard-swiglu-wrapb-k2-scratch
Run set
Run set
2
Run set 2
Add a comment