inference-platform

There are 2 repositories under inference-platform topic.

  • BentoML

    bentoml/BentoML

    The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

    Language:Python8.2k781.1k885
  • InftyAI/llmaz

    ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

    Language:Go264815644