Pinned Repositories
bentoctl
Fast model deployment on any cloud 🚀
BentoDiffusion
BentoDiffusion: A collection of diffusion models served with BentoML
BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
BentoVLLM
Self-host LLMs with vLLM and BentoML
BentoVoiceAgent
Build Phone Calling Voice Agent fully powered by open source models.
comfy-pack
A comprehensive toolkit for reliably locking, packing and deploying environments for ComfyUI workflows.
gallery
BentoML Example Projects 🎨
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
stable-diffusion-server
Deploy Your Own Stable Diffusion Service
Yatai
Model Deployment at Scale on Kubernetes 🦄️
BentoML's Repositories
bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
bentoml/BentoDiffusion
BentoDiffusion: A collection of diffusion models served with BentoML
bentoml/comfy-pack
A comprehensive toolkit for reliably locking, packing and deploying environments for ComfyUI workflows.
bentoml/BentoVLLM
Self-host LLMs with vLLM and BentoML
bentoml/BentoOCR
Turn any OCR models into online inference API endpoint 🚀 🌖
bentoml/BentoVoiceAgent
Build Phone Calling Voice Agent fully powered by open source models.
bentoml/openllm-models
bentoml/BentoChatTTS
bentoml/BentoLMDeploy
Self-host LLMs with LMDeploy and BentoML
bentoml/yatai-image-builder
🐳 Build OCI images for Bentos in k8s
bentoml/yatai-deployment
🚀 Launching Bento in a Kubernetes cluster
bentoml/BentoWhisperX
bentoml/BentoCLIP
building a CLIP application using BentoML
bentoml/quickstart
BentoML Quickstart Example
bentoml/BentoLangGraph
Serving LangGraph Agent as REST API with BentoML, optionally with self-host open-source LLMs
bentoml/BentoFunctionCalling
bentoml/BentoXTTS
how to build an text-to-speech application using BentoML
bentoml/BentoMLCLLM
bentoml/BentoResnet
bentoml/BentoXTTSStreaming
xtts with streaming endpoint
bentoml/bentocloud-homepage-news
bentoml/BentoSGLang
bentoml/BentoShield
bentoml/LLMGateway
bentoml/yatai-common
bentoml/BentoMLflow
bentoml/helm-charts
bentoml/kantoku
A Process & Socket Manager built with zmq
bentoml/BentoXGBoost