/efficient-attention

[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers