Pinned Repositories
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
qwen.cpp
C++ implementation of Qwen-LM
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zero_nlp
中文nlp应用(数据、模型、训练、推理) chatglm6b
zhangzai666's Repositories
zhangzai666/zero_nlp
中文nlp应用(数据、模型、训练、推理) chatglm6b