fla-org/flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

PythonMIT

Readme
79Issues
1.6kStargazers
29Watchers

Watchers

blais
New York City
sourcesync
San Francisco
hannibalhuang
Shenzhen
wookayin
New York, NY
michalwols
New York
YangWang92
lcy-seso
China
vBaiCai
Beijing, China
sodabeta7
Cupertino
renll
Redmond, Washington
apointa
zineos
yzhangcs
Shanghai
dscamiss
PengLU1101
Canada
drkostas
realamirhe
Tehran
cpuimage
Shantou, China
radarFudan
Singapore
liyanc
Austin, TX
sustcsonglin
Cambridge
LegendBC
Wuhan, China
MonadKai
Beijing, China
haojiwei
3outeille
France
vapavlo
ghchris2021
Ryu1845
joshgong1977

Contact site admin: Geeks.