Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
Primary LanguagePythonMIT LicenseMIT
No issues in this repository yet.