Quantization of models?

Question

Quantization of models?

MilanBojic1999 opened this issue 6 months ago · 1 comments

MilanBojic1999 commented 6 months ago

Hi,
I am wondering if there is any plan to make quantization of chameleon models available.
I'm working with 3060 12gb and while trying to load 7b model, I get CUDA out of memory error.

Thank you!

Answer 1 · 2024-06-21T16:18:27.000Z

Unfortunately we have no plans to release a quantized model in the near future.