An implementation of Performer, a linear attention-based transformer, in Pytorch
Primary LanguagePythonMIT LicenseMIT