fixed bench sweep
I've long suspected that there was smth wrong w backprop
Created on August 17|Last edited on August 18
Comment
does any backprop work?
Computing group metrics from first 36 groups
lr, plast_clip, updater
is it rly just about effective lr?
updater vs. recurrence. We hope to see solid performance on both dfa and backprop even without recurrence.
is clip_weights useful at all, and if so, what value of it?
is residual_connections a useful parameter?
Add a comment