Jiayuanhip

Pinned Repositories

RAG
00
attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
Language:Python666 12 3040

Jiayuanhip's Repositories

Jiayuanhip/RAG
00