infichen's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
WeNeedHome/SummaryOfLoanSuspension
全国各省市停贷通知汇总
wuba/dl_inference
通用深度学习推理工具,可在生产环境中快速上线由TensorFlow、PyTorch、Caffe框架训练出的深度学习模型。
tkestack/gpu-manager