/quantum-attention

(WIP) A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs

MIT LicenseMIT

quantum-attention

A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs