OmniQuant is a simple and powerful quantization technique for LLMs.
Primary LanguagePython
No one’s star this repository yet.