preemware/worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by VLLM.
PythonMIT
Watchers
No one’s watching this repository yet.
The RunPod worker template for serving our large language model endpoints. Powered by VLLM.
PythonMIT
No one’s watching this repository yet.