Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
Primary LanguagePythonMIT LicenseMIT