/LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Primary LanguagePythonMIT LicenseMIT

Watchers