/CoLT5-attention

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Primary LanguagePythonMIT LicenseMIT

Stargazers