/streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.