willsamu/worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
PythonMIT
No issues in this repository yet.
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
PythonMIT
No issues in this repository yet.