8bit quantization

Question

8bit quantization

paolo-losi opened this issue 6 months ago · 1 comments

Would it be possible to support 8bit quantization?

Answer 1 · 2024-07-10T13:34:35.000Z

Hi @paolo-losi, we try to keep the number of args low. Hence we decided to go with 4bit quantization with the memory preset. Is there an issue with the quality of the 4-bit model ?