Reports
Created by
Created On
Last edited
griffin arch on fineweb-1M
comparing the effect of tokenizers on small griffin arch on a sample of the fineweb dataset.
1
2024-04-28
griffin aka recurrent_gemma arch
some initial experiments with activation, layer count on simple_wikipedia_LM
0
2024-04-25