/GPTQ-for-LLaMa-CUDA

A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers