/flash-cosine-sim-attention

Implementation of fused cosine similarity attention in the same style as Flash Attention

Primary LanguageCudaMIT LicenseMIT

Stargazers