opengear-project/GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

PythonMIT

Readme
19Issues
146Stargazers
1Watcher

Watchers

HaoKang-Timmy
Georgia Institute of Technology

Contact site admin: Geeks.