Skip to main content

fixed bench sweep

I've long suspected that there was smth wrong w backprop
Created on August 17|Last edited on August 18

does any backprop work?


Computing group metrics from first 36 groups
500k1M1.5M2Miter00.20.40.60.81
updater: dfa no_recur_lr3
updater: backprop no_recur_lr3
updater: dfa no_recur_lr4
updater: backprop no_recur_lr4
Run set
644
no_recur_lr3
160
no_recur_lr4
160

lr, plast_clip, updater

Run set
644
no_recur_lr3
160
no_recur_lr4
160

is it rly just about effective lr?

Run set
644
no_recur_lr3
160
no_recur_lr4
160

updater vs. recurrence. We hope to see solid performance on both dfa and backprop even without recurrence.

Run set
644

is clip_weights useful at all, and if so, what value of it?

Run set
132
no_recur
320

is residual_connections a useful parameter?

Run set
132
no_recur
64