/long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Watchers

No one’s watching this repository yet.