/flashattention2-custom-mask

Triton implementation of FlashAttention2 that adds Custom Masks.

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers