Skip to main content
eleutherai
Projects
mesh-transformer-jax
Reports
Normalization shootout v2
Log in
Sign up
Share
Comment
Star
Share
Comment
Star
Normalization shootout v2
Ben Wang
Created on March 23
|
Last edited on May 4
Comment
Section 1
val/loss
val/loss
0
20k
40k
60k
80k
100k
120k
140k
Step
4
6
8
10
medium_scalenorm
medium_scalenorm_bias
medium_rmsnorm_bias
medium_rmsnorm
medium_layernormnobias
medium_layernorm
train/loss
train/loss
0
20k
40k
60k
80k
100k
120k
140k
Step
4
6
8
10
Run set
6
Run set
6
Add a comment