Initial run results
Created on October 25|Last edited on November 16
Comment
Overall results
Results obtained over 20 runs of each of the eight configurations we considered.
Experimental settings: Adam with default settings (NB: Includes weight decay); minibatches of size 500; training for 500 epochs.
Expansion: Start with 2 hidden units, and expand into 4.
Computing group metrics from first 10 groups
exp8-large (20 runs)
120
Effective rank
Run set
160
Most interesting runs; heatmaps
Run set
4
Add a comment