glassroom/heinsen_attention
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
PythonMIT
Stargazers
- davidchern
- DefTruthStatistics Department of JNU
- dsx-aishanghai
- fheinsen
- iceychrisAugsburg University of Applied Sciences
- iTechX-stu
- JCBrouwerBlueGen.ai
- livelxwChina Jiangsu Wuxi
- LouChao98
- menegazzi
- PaulmzrUniversity of Chinese Academy of Sciences
- progerSupercomputer City
- radarFudanNUS
- Ryu1845
- Shomvel
- sidereiorKnowBeforeYouGo Inc.
- sustcsonglinMIT
- ultranity
- weigao266Shanghai, China
- yzhangcsSoochow University
- zhixuan-linUniversity of Montreal
- zivzoneNational Chaio tung univeristy