OmniQuant is a simple and powerful quantization technique for LLMs.
Primary LanguagePython
No issues in this repository yet.