preemware/worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by VLLM.
PythonMIT
Stargazers
No one’s star this repository yet.
The RunPod worker template for serving our large language model endpoints. Powered by VLLM.
PythonMIT
No one’s star this repository yet.