A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs
chengzeyi/quantum-attention
(WIP) A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs
MIT
(WIP) A Flash-Attention 3 counterpart that achieves peak performance on both Ampere and Hopper GPUs
MIT