llmops

There are 267 repositories under llmops topic.

  • microsoft/autogen

    A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

    Language:Jupyter Notebook26k3451.2k3.7k
  • jina

    jina-ai/jina

    ☁️ Build multimodal AI applications with cloud-native stack

    Language:Python20.2k2081.9k2.2k
  • vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Language:Python19.6k1942.8k2.6k
  • SuperAGI

    TransformerOptimus/SuperAGI

    <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

    Language:Python14.6k1723991.7k
  • BerriAI/litellm

    Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

    Language:Python9k592.3k986
  • bentoml/OpenLLM

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

    Language:Python8.9k52249566
  • phidatahq/phidata

    Build AI Assistants with memory, knowledge and tools.

    Language:Python8.1k561021.1k
  • infiniflow/ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    Language:Python7.7k46388686
  • liguodongiot/llm-action

    本项目旨在分享大模型相关技术原理以及实战经验。

    Language:HTML6.6k6815652
  • BentoML

    bentoml/BentoML

    The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

    Language:Python6.6k731k744
  • explodinggradients/ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

    Language:Python5k27502441
  • gateway

    Portkey-AI/gateway

    A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.

    Language:TypeScript4.8k34203315
  • superduperdb

    SuperDuperDB/superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

    Language:Python4.4k401.1k436
  • langfuse

    langfuse/langfuse

    🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

    Language:TypeScript3.7k12349351
  • zenml

    zenml-io/zenml

    ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.

    Language:Python3.7k40282404
  • giskard

    Giskard-AI/giskard

    🐢 Open-Source Evaluation & Testing for LLMs and ML models

    Language:Python3.2k26425207
  • tensorchord/Awesome-LLMOps

    An awesome & curated list of best LLMOps tools for developers

    Language:Shell3.1k579316
  • promptfoo/promptfoo

    Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

    Language:TypeScript2.9k16383189
  • phoenix

    Arize-ai/phoenix

    AI Observability & Evaluation

    Language:Jupyter Notebook2.8k251.5k196
  • cube-studio

    tencentmusic/cube-studio

    cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

    Language:Jupyter Notebook2.6k68141485
  • llm-app

    pathwaycom/llm-app

    LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚡in sync with your data sources.

    Language:Dockerfile2.6k2713162
  • Josh-XT/AGiXT

    AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

    Language:Python2.5k58350328
  • iusztinpaul/hands-on-llms

    🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

    Language:Jupyter Notebook2.4k449385
  • OpenPipe/OpenPipe

    Turn expensive prompts into cheap fine-tuned models

    Language:TypeScript2.4k2056124
  • dot-agent/nextpy

    🤖Self-Modifying Framework from the Future 🔮 World's First AMS

    Language:Python2.1k2633152
  • ianarawjo/ChainForge

    An open-source visual programming environment for battle-testing prompts to LLMs.

    Language:TypeScript2k24162146
  • uptrain-ai/uptrain

    UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

    Language:Python2k19134171
  • tensorchord/envd

    🏕️ Reproducible development environment

    Language:Go1.9k22528154
  • pezzo

    pezzolabs/pezzo

    🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

    Language:TypeScript1.8k2197169
  • microsoft/aici

    AICI: Prompts as (Wasm) Programs

    Language:Rust1.8k197472
  • truera/trulens

    Evaluation and Tracking for LLM Experiments

    Language:Jupyter Notebook1.7k17222146
  • bionic-gpt/bionic-gpt

    BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality

    Language:Rust1.6k20281155
  • predibase/lorax

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    Language:Python1.6k29200110
  • DAGWorks-Inc/hamilton

    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

    Language:Jupyter Notebook1.4k1223086
  • traceloop/openllmetry

    Open-source observability for your LLM application, based on OpenTelemetry

    Language:Python1.3k67897
  • cognita

    truefoundry/cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

    Language:Python1.3k171128