Done with 10% of weights having the multiplied(larger) plasticity.
Select runs that logged avg_loss
to visualize data in this line chart.
1%
with decay
Call it a "treadmill". More comprehensive hyperparameter sweep involving all three values. The decay rule is now simplified to exclude the norm.
another go at it, something seems fishy.
woah, check out the minimums
benchmark against backprop