CharlesRiggins

Share and prosper.

Berkeley

Pinned Repositories

vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
123
0 1 00
Git-Github-notes-for-study
学习Git和GitHub过程中的学习总结和思考（开源特训营）
5 4 811
llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Language:Python835 15 13370
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.3k 271 5.8k5.1k
java_project
Language:Java11

CharlesRiggins/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00