Skip to main content

654 Scaling Laws with soft metrics

Created on January 16|Last edited on January 16

500G1T1.5Tthroughput/total_gflops0.850.90.95
10k20k30k40k50kStep0.80.850.90.951
Run set
6662