SqueezeBits/QUICK

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

PythonMIT

Readme
6Issues
106Stargazers
1Watcher

Watchers

ghchris2021

Contact site admin: Geeks.