mega clip sweep
Created on April 30|Last edited on May 6
Comment
a big, big sweep. Lots of hyperparameters, over 500 combinations, actually. Different values of learning rate, plast clip (high plasticity), forget rate, and grad clip.
Computing group metrics from first 36 groups
Computing group metrics from first 43 groups
Run set
504
504
504
504
504
504
504
Run set 4
504
504
504
504
Run set 4
103
504
504
504
Run set 4
504
Add a comment