/quantum-attention

(WIP) A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs

MIT LicenseMIT

Watchers