4 bits quantization of LLaMa using GPTQ
Primary LanguagePython
No one’s watching this repository yet.