/backpacks-flash-attn

The original Backpack Language Model implementation, a fork of FlashAttention

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Issues