ReLU/tanh/Linear 4-->8; 8-->16 Runs
Created on November 4|Last edited on November 11
Comment
Overall results
Results obtained over 10 runs for 4 most interesting combinations of run.
Experimental settings: Adam with default settings; minibatches of size 32; training for 100 + 100 epochs. Initial parameters are all sampled from N(0, 1).
Types of initialization: We add a tiny bit of noise to all the matrices to break ties
- Random: Sample new matrix from N(0, 1)
- Random adjusted: Sample from N(\mu, \sigma) where \mu is mean of parent matrix, \sigma is std of parent matrix
- Permuted: Randomly shuffle entries in matrix
- Copy: Copy matrix
Expansion: Start 4-->8 or 8-->16
50
50
50
50
40
8 --> 16 (linear)
40
Add a comment