Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
Primary LanguagePythonMIT LicenseMIT