/heinsen_attention

Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)

Primary LanguagePythonMIT LicenseMIT

Stargazers