/llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers

No one’s star this repository yet.