/long-short-transformer

Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch

Primary LanguagePythonMIT LicenseMIT

Watchers