/flash-attention

Fast and memory-efficient exact attention

Primary LanguageC++Apache License 2.0Apache-2.0

Watchers