Porting GPTQ to CPU?

Question

yiliu30 opened this issue 2 years ago · 2 comments

Is it possible to run GPTQ on a machine that has only CPUs? If not, is there a plan for it?

Answer 1 · 2023-05-22T14:45:34.000Z

You can use a GPTQ quantized model with llama.cpp by using this conversion script I believe.

Answer 2 · 2023-11-23T13:35:38.000Z

just quant model on CPU?