/ApiQ

[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.