4 bits quantization of LLaMA using GPTQ
Primary LanguagePython
No one’s watching this repository yet.