/memory-efficient-attention

Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch

Primary LanguagePythonMIT LicenseMIT