john-hewitt/backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
PythonBSD-3-Clause
Issues
- 0
EOFError when np.load
#4 opened by Tangyiming205069 - 0
- 3
Hydra's cascading AttributeErrors
#3 opened by nuankw - 0