integracore2/attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
PythonApache-2.0
Stargazers
No one’s star this repository yet.
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
PythonApache-2.0
No one’s star this repository yet.