lucidrains/memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

PythonMIT

Readme
6Issues
370Stargazers
9Watchers

Watchers

bratao
Escavador
drkostas
University of Tennessee, Knoxville
dvaltchanov
eemailme
lucidrains
San Francisco
michalwols
New York
nbardy
San Francisco, CA
purvang3
Fremont, CA, USA
vicgalle
Komorebi AI & ICMAT-CSIC

Contact site admin: Geeks.