[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
Primary LanguageJupyter Notebook