/kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports LLMs, embeddings, and speech-to-text.

Primary LanguageGoApache License 2.0Apache-2.0

Issues