tomaarsen/attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

PythonApache-2.0

Readme
30Issues
684Stargazers
11Watchers

Watchers

eemailme
estibi
Poland
hugochoquet
online2311
pavaris-pm
@scamtify
planb788
tfburlingame
Global Public Safety
thevasudevgupta
@Unbox-AI
tomaarsen
Hugging Face
wDevil
Tinkoff
William-glitch-knight

Contact site admin: geeksiteservice@gmail.com.