/ParaAttention

[WIP] Context parallel attention that works with torch.compile

Primary LanguagePythonOtherNOASSERTION

Stargazers