tomaarsen/attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

PythonApache-2.0

Readme
30Issues
678Stargazers
11Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.