Skip to main content

Snapshot Feb 24 2021, 2:18pm

GPT2_XL_pipe, regular adam. mp=2, pp=2, size reduced to match 1-bit adam model.
Created on February 24|Last edited on February 24

Section 1


5001k1.5k2kStep510152025
05001k1.5k2kStep5678910
5001k1.5k2kStep1e+92e+93e+94e+9
Run set
16



Run set
16