Skip to main content

654 Scaling Law

https://github.com/stanford-crfm/marin/issues/654
Created on December 11|Last edited on May 12

Section 1


70G80G90G100G200G300G400G500G600G700G800G900G1T2Tlog(gflops)0.90.951bpb
50G60G70G80G90G100G200Gthroughput/total_tokens0.90.951bpb
Run set
6




Run set
6



Run set
6