/smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.