/FlashAttention.C

Flash Attention in raw Cuda C beating PyTorch

Primary LanguageCuda

Watchers