/DuQuant

[NeurIPS 2024 OralšŸ”„] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Primary LanguagePythonMIT LicenseMIT

Issues