feat: Support more models

Question

Closed this issue a year ago · 3 comments

Answer 1 · 2023-05-22T09:39:36.000Z

There is no difference between the quantized model and the original model, at least for such a service.

Answer 2 · 2023-05-22T09:40:30.000Z

Thus we need to update the env vars to THUDM/chatglm-6b, thus it should work, right?

Answer 3 · 2023-05-22T09:41:41.000Z

Thus we need to update the env vars to THUDM/chatglm-6b, thus it should work, right?

It should work. I use the int4 because it's tiny to try.