Skip to main content

Loss depends on batch?

Batches are randomized, so loss should not depend so strongly on batch ID... debugging.
Created on May 27|Last edited on May 27

Section 1


05k10k15k20k25k30kStep12345
Run: 1-bit-grad
1