/FLASH-pytorch

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Primary LanguagePythonMIT LicenseMIT

Watchers