/OmniQuant

OmniQuant is a simple and powerful quantization technique for LLMs.

Primary LanguagePython

Stargazers