llama

There are 2121 repositories under llama topic.

  • ollama/ollama

    Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

    Language:Go152k8578.2k13.1k
  • LLaMA-Factory

    hiyouga/LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Language:Python58.3k2907.4k7.2k
  • vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Language:Python58.1k43810.8k10.1k
  • unsloth

    unslothai/unsloth

    Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

    Language:Python45.5k2622.5k3.7k
  • aider

    Aider-AI/aider

    aider is AI pair programming in your terminal

    Language:Python37.4k1853.2k3.5k
  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

    Language:TypeScript36.1k2954.2k6k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

    Language:Go35.3k2321.1k2.8k
  • haotian-liu/LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Language:Python23.5k1601.6k2.6k
  • fishaudio/fish-speech

    SOTA Open Source TTS

    Language:Python22.9k1195461.9k
  • HqWu-HITCS/Awesome-Chinese-LLM

    整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

  • yamadashy/repomix

    📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

    Language:TypeScript19.2k51176855
  • Chinese-LLaMA-Alpaca

    ymcui/Chinese-LLaMA-Alpaca

    中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

    Language:Python18.9k1827321.9k
  • sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Language:Python17.9k1173.2k2.9k
  • meta-llama/llama-cookbook

    Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

    Language:Jupyter Notebook17.9k1924382.6k
  • GaiZhenbiao/ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

    Language:Python15.4k868012.3k
  • LlamaFamily/Llama-Chinese

    Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

    Language:Python14.7k1483401.3k
  • cocktailpeanut/dalai

    The simplest way to run LLaMA on your local machine

    Language:CSS13.1k1483801.4k
  • PaddleNLP

    PaddlePaddle/PaddleNLP

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

    Language:Python12.8k1003.8k3.1k
  • AstrBotDevs/AstrBot

    ✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify

    Language:Python12.1k432.2k866
  • bentoml/OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Language:Python11.8k57273770
  • ludwig

    ludwig-ai/ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

    Language:Python11.6k1931.1k1.2k
  • TheR1D/shell_gpt

    A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

    Language:Python11.3k96351908
  • getumbrel/llama-gpt

    A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

    Language:TypeScript11k82129710
  • tensorzero

    tensorzero/tensorzero

    TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

    Language:Rust10.3k32537685
  • modelscope/ms-swift

    Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

    Language:Python9.9k443.3k870
  • bigscience-workshop/petals

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Language:Python9.8k99208572
  • dataelement/bisheng

    BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

    Language:TypeScript9.6k6103111.6k
  • langchain4j/langchain4j

    LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.

    Language:Java9k1061.6k1.7k
  • xorbitsai/inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

    Language:Python8.5k582.5k741
  • oumi-ai/oumi

    Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

    Language:Python8.5k6279644
  • SJTU-IPADS/PowerInfer

    High-speed Large Language Model Serving for Local Deployment

    Language:C++8.3k81197444
  • reorproject/reor

    Private & local AI personal knowledge management app for high entropy people.

    Language:JavaScript8.3k50209505
  • LianjiaTech/BELLE

    BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

    Language:HTML8.2k105443768
  • LostRuins/koboldcpp

    Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

    Language:C++8.2k821.1k535
  • zilliztech/GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

    Language:Python7.8k56175564
  • Chinese-LLaMA-Alpaca-2

    ymcui/Chinese-LLaMA-Alpaca-2

    中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

    Language:Python7.2k78392569