/multi-query-attention

Fast Transformer Decoding: One Write-Head is All You Need

Primary LanguagePython

Stargazers

No one’s star this repository yet.