lucidrains/memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

PythonMIT

Readme
6Issues
355Stargazers
9Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.