LR-finder approaches face-off
Comparison of effectiveness of LR-finder suggestions for NLP tasks on GLUE benchmark
Created on May 5|Last edited on May 10
Comment
SST2
Binary classification task. Dataset size: train - 67k, valid - 872. Fine-tuning whole model (no "freezing").
The results are averaged across 5 independent lr_find() calls. To explore consistency over runs one can ungroup the results for certain method and compare individual runs.
Run set
20
CoLA
Binary classification task. Dataset size: train - 8.5k, valid - 1k.
Run set
20
MRPC
Binary classification task. Dataset size: train - 3668, valid - 408.
Run set
20
Add a comment