/sparse_quant_llms

SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia

Primary LanguagePython

Issues