[NeurIPS 2024 Oralš„] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
Primary LanguagePythonMIT LicenseMIT