llmops

There are 558 repositories under llmops topic.

  • vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Language:Python58.1k43710.8k10.1k
  • llm-app

    pathwaycom/llm-app

    Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

    Language:Jupyter Notebook40.6k71171.1k
  • BerriAI/litellm

    Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

    Language:Python28.9k1406.9k4.1k
  • ComposioHQ/composio

    Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling

    Language:TypeScript25.7k532654.4k
  • mlflow

    mlflow/mlflow

    The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

    Language:Python22k3164.6k4.8k
  • serve

    jina-ai/serve

    ☁️ Build multimodal AI applications with cloud-native stack

    Language:Python21.7k2151.9k2.2k
  • liguodongiot/llm-action

    本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

    Language:HTML20.8k140282.5k
  • SuperAGI

    TransformerOptimus/SuperAGI

    <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

    Language:Python16.7k1754252.1k
  • langfuse

    langfuse/langfuse

    🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

    Language:TypeScript16.1k511.8k1.5k
  • raga-ai-hub/RagaAI-Catalyst

    Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

    Language:Python16.1k28413.7k
  • comet-ml/opik

    Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

    Language:Python13.9k91421983
  • bentoml/OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Language:Python11.8k60279770
  • explodinggradients/ragas

    Supercharge Your LLM Application Evaluations 🚀

    Language:Python10.7k421.1k1.1k
  • tensorzero

    tensorzero/tensorzero

    TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

    Language:Rust10.3k32537685
  • dataelement/bisheng

    BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

    Language:TypeScript9.6k6103111.6k
  • gateway

    Portkey-AI/gateway

    A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

    Language:TypeScript9.5k45557731
  • metaflow

    Netflix/metaflow

    Build, Manage and Deploy AI/ML Systems

    Language:Python9.5k292731878
  • promptfoo/promptfoo

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

    Language:TypeScript8.4k20838697
  • BentoML

    bentoml/BentoML

    The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

    Language:Python8.1k801.1k874
  • phoenix

    Arize-ai/phoenix

    AI Observability & Evaluation

    Language:Jupyter Notebook7k373.3k570
  • evidentlyai/evidently

    Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

    Language:Jupyter Notebook6.6k52458723
  • traceloop/openllmetry

    Open-source observability for your LLM application, based on OpenTelemetry

    Language:Python6.4k8223799
  • tensorchord/Awesome-LLMOps

    An awesome & curated list of best LLMOps tools for developers

    Language:Shell5.3k819504
  • superduper

    superduper-io/superduper

    Superduper: End-to-end framework for building custom AI applications and agents.

    Language:Python5.2k431.4k527
  • coze-dev/coze-loop

    Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.

    Language:Go4.9k641
  • zenml

    zenml-io/zenml

    ZenML 🙏: MLOps for Reliable AI: from Classical AI to Agents. https://zenml.io.

    Language:Python4.9k43395540
  • giskard-oss

    Giskard-AI/giskard-oss

    🐢 Open-Source Evaluation & Testing library for LLM Agents

    Language:Python4.9k37493355
  • cube-studio

    tencentmusic/cube-studio

    cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

    Language:Python4.6k77157804
  • Helicone/helicone

    🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

    Language:TypeScript4.5k20187434
  • 0xPlaygrounds/rig

    ⚙️🦀 Build modular and scalable LLM Applications in Rust

    Language:Rust4.4k37129482
  • cognita

    truefoundry/cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

    Language:Python4.2k3449354
  • decodingml/llm-twin-course

    🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

    Language:Python4.1k7321688
  • PacktPublishing/LLM-Engineers-Handbook

    The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

    Language:Python4.1k4225941
  • katanemo/archgw

    The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: like applying guardrails, routing prompts to the right agent, and unifying access to LLMs, etc. Natively designed to handle and process prompts, Arch helps you build agents faster.

    Language:Rust3.7k26108204
  • predibase/lorax

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    Language:Python3.4k34264263
  • iusztinpaul/hands-on-llms

    🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

    Language:Jupyter Notebook3.4k4821536