implemention of flash attention v1 and v2 with numpy
Primary LanguagePython
This is an implemention of flash attention with numpy, include V1 and V2