MegEngine/InferLLM

chatglm2 GPU版本的int4、int8量化模型预测结果异常

Closed this issue · 1 comments

image

maybe there is a bug in an early commit, now it is ok in the main branch.