Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
Primary LanguageJupyter NotebookMIT LicenseMIT
No one’s watching this repository yet.