riff on normal distributions
Created on September 11|Last edited on September 11
Comment
positive plasticity only.
Run set
9
gelu
Run set
9
use mse loss instead of crossentropy.
Run set
5
slower last layer, remove relu derivative calculation, since I'm using sigmoid rn.
Run set
8
even slower last layer
Run set
12
actually make the last layer faster
Run set
9
maybe fixed pos_only
Run set
6
go mini
Run set
6
Add a comment