/AutoGPTQ-triton

An easy-to-use model quantization package with user-friendly apis, based on GPTQ algorithm.

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.