Reports
Created by
Created On
Last edited
Parameter sharing, revisited (again)
We evaluate standard and homebrew techniques for parameter sharing in GPT-like language models on OpenWebText dataset; conducted partly by  Tim Dettmers, Aleksandr Borzunov, Michael Diskin and Max Ryabinin
1
2022-04-26