Sinkhorn Transformer - Usable implementation of Sparse Sinkhorn Attention
Primary LanguagePythonMIT LicenseMIT