fblgit/AutoGPTQ-triton
An easy-to-use model quantization package with user-friendly apis, based on GPTQ algorithm.
PythonMIT
No issues in this repository yet.
An easy-to-use model quantization package with user-friendly apis, based on GPTQ algorithm.
PythonMIT
No issues in this repository yet.