/RetentiveNetwork

Unofficial codebase for the "Retentive Network: A Successor to Transformer for Large Language Models" paper [https://arxiv.org/pdf/2307.08621.pdf]

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.