llm

There are 26577 repositories under llm topic.

  • ollama/ollama

    Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

    Language:Go152k8578.2k13.1k
  • transformers

    huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Language:Python150k1.2k17.9k30.4k
  • langchain-ai/langchain

    🦜🔗 Build context-aware reasoning applications 🦜🔗

    Language:Jupyter Notebook115k7738.7k19k
  • dify

    langgenius/dify

    Production-ready platform for agentic workflow development.

    Language:TypeScript114k67913.3k17.5k
  • open-webui

    open-webui/open-webui

    User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

    Language:JavaScript110k4986.4k15.1k
  • LLMs-from-scratch

    rasbt/LLMs-from-scratch

    Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

    Language:Jupyter Notebook71.3k62218510.3k
  • browser-use

    browser-use/browser-use

    🌐 Make websites accessible for AI agents. Automate tasks online with ease.

    Language:Python69.9k3711.1k8.2k
  • infiniflow/ragflow

    RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

    Language:TypeScript64.3k2935.2k6.7k
  • All-Hands-AI/OpenHands

    🙌 OpenHands: Code Less, Make More

    Language:Python63.5k3762.5k7.6k
  • mlabonne/llm-course

    Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

  • FoundationAgents/MetaGPT

    🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

    Language:Python58.4k7.1k
  • LLaMA-Factory

    hiyouga/LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Language:Python58.3k2907.4k7.2k
  • vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Language:Python58.1k43810.8k10.1k
  • firecrawl/firecrawl

    The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥

    Language:TypeScript57.5k4.8k
  • anything-llm

    Mintplex-Labs/anything-llm

    The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

    Language:JavaScript49.1k3402.8k5.1k
  • unsloth

    unslothai/unsloth

    Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

    Language:Python45.5k2622.5k3.7k
  • run-llama/llama_index

    LlamaIndex is the leading framework for building LLM-powered agents over your data.

    Language:Python44.3k2596.1k6.4k
  • llm-app

    pathwaycom/llm-app

    Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

    Language:Jupyter Notebook40.4k4617407
  • mem0ai/mem0

    Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

    Language:Python39.9k2021.1k4.2k
  • zhayujie/chatgpt-on-wechat

    基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

    Language:Python39k2752k9.4k
  • ray-project/ray

    Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Language:Python38.9k49121.1k6.8k
  • quivr

    QuivrHQ/quivr

    Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

    Language:Python38.4k2841.5k3.7k
  • 2noise/ChatTTS

    A generative speech model for daily dialogue.

    Language:Python37.8k1946294.1k
  • milvus

    milvus-io/milvus

    Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

    Language:Go37.4k31514.2k3.4k
  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

    Language:TypeScript36.1k2954.2k6k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

    Language:Go35.3k2321.1k2.8k
  • cherry-studio

    CherryHQ/cherry-studio

    🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

    Language:TypeScript33.1k1546.8k3k
  • khoj

    khoj-ai/khoj

    Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

    Language:Python31k1535531.8k
  • upstash/context7

    Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors

    Language:JavaScript30.3k1.5k
  • JushBJJ/Mr.-Ranedeer-AI-Tutor

    A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

  • continuedev/continue

    ⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI

    Language:TypeScript28.9k1273.7k3.5k
  • BerriAI/litellm

    Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

    Language:Python28.9k1406.9k4.1k
  • microsoft/graphrag

    A modular graph-based Retrieval-Augmented Generation (RAG) system

    Language:Python28.1k1666832.9k
  • voideditor/void

    Language:TypeScript27k1595042k
  • semantic-kernel

    microsoft/semantic-kernel

    Integrate cutting-edge LLM technology quickly and easily into your apps

    Language:C#26.2k2974.7k4.2k
  • labring/FastGPT

    FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

    Language:TypeScript25.8k1473.1k6.6k