Skip to main content
ai2-llm
Projects
olmoe
Reports
Plot: Shared vs Dense
Log in
Sign up
Share
Comment
Star
Share
Comment
Star
Plot: Shared vs Dense
Niklas Muennighoff
Created on June 23
|
Last edited on August 29
Comment
Red/23515: Shared ; Green/23520: Dense
eval/c4_en-validation/CrossEntropyLoss
eval/c4_en-validation/CrossEntropyLoss
20k
40k
60k
80k
100k
120k
140k
Step
3
3.5
4
4.5
5
23520
Run set
23515
Run set
train/CrossEntropyLoss
train/CrossEntropyLoss
20k
40k
60k
80k
100k
120k
140k
Step
4
6
8
10
23520
Run set
23515
Run set
Run set
2
Run set 2
Add a comment