/EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers