Skip to main content
Reports
Created by
Created On
Last edited
Parameter sharing, revisited (again)
We evaluate standard and homebrew techniques for parameter sharing in GPT-like language models on OpenWebText dataset; conducted partly by Tim Dettmers, Aleksandr Borzunov, Michael Diskin and Max Ryabinin
1
2022-04-26
0
2022-06-06