/sparse-structured-attention

Sparse and structured neural attention mechanisms

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

Watchers