Skip to main content
eleutherai
Projects
neox
Reports
Regular adam, mp=1, pp=2, ZeRO-1, 12 layers, 1 node
Log in
Sign up
Share
Comment
Star
Share
Comment
Star
Regular adam, mp=1, pp=2, ZeRO-1, 12 layers, 1 node
Shivanshu Purohit
Created on February 24
|
Last edited on February 24
Comment
Section 1
iteration_time
iteration_time
500
1k
1.5k
2k
Step
5
10
15
lm loss
lm loss
0
500
1k
1.5k
2k
Step
5
6
7
8
9
10
11
loss_scale
loss_scale
500
1k
1.5k
2k
Step
1e+9
2e+9
3e+9
4e+9
Run set
8
Run set
8
Add a comment