llama

There are 1450 repositories under llama topic.

ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go105k 607 5.3k8.4k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70k 558 4.2k10.1k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37k 219 5.6k4.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python32.9k 271 5.8k5k
chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript32.7k 290 4k5.6k
mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language:Go27.4k 192 9052k
Aider-AI/aider
aider is AI pair programming in your terminal
Language:Python24k 157 2.3k2.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.8k 134 1.2k1.4k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.6k 183 7321.9k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.9k 111 4761.3k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
17.1k 216 271.6k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 204 3982.3k
GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language:Python15.3k 84 7982.3k
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python14.3k 149 3371.3k
cocktailpeanut/dalai
The simplest way to run LLaMA on your local machine
Language:CSS13.1k 148 3811.4k
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.3k 105 3.7k3k
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language:Python11.3k 194 1.1k1.2k
getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Language:TypeScript10.9k 82 128708
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
Language:Python10.3k 56 270654
TheR1D/shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Language:Python10k 95 328785
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python9.3k 95 204525
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language:Python9.1k 908 1691.6k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++8k 78 172419
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML8k 108 442764
reorproject/reor
Private & local AI personal knowledge management app for high entropy people.
Language:TypeScript7.4k 47 203455
zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Language:Python7.3k 58 173512
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k 78 389579
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.9k 63 805632
k8sgpt-ai/k8sgpt
Giving Kubernetes Superpowers to everyone
Language:Go6.1k 59 299700
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python6k 55 281533
LostRuins/koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language:C++5.9k 69 849384
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python5.8k 43 1.6k479
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Language:Svelte5.7k 47 181404
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 68 128506
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook5.5k 130 185440

llama

ollama/ollama

ggerganov/llama.cpp

hiyouga/LLaMA-Factory

vllm-project/vllm

chatchat-space/Langchain-Chatchat

mudler/LocalAI

Aider-AI/aider

haotian-liu/LLaVA

unslothai/unsloth

ymcui/Chinese-LLaMA-Alpaca

fishaudio/fish-speech

HqWu-HITCS/Awesome-Chinese-LLM

meta-llama/llama-recipes

GaiZhenbiao/ChuanhuChatGPT

LlamaFamily/Llama-Chinese

cocktailpeanut/dalai

PaddlePaddle/PaddleNLP

ludwig-ai/ludwig

getumbrel/llama-gpt

bentoml/OpenLLM

TheR1D/shell_gpt

bigscience-workshop/petals

dataelement/bisheng

SJTU-IPADS/PowerInfer

LianjiaTech/BELLE

reorproject/reor

zilliztech/GPTCache

ymcui/Chinese-LLaMA-Alpaca-2

sgl-project/sglang

k8sgpt-ai/k8sgpt

yangjianxin1/Firefly

LostRuins/koboldcpp

xorbitsai/inference

serge-chat/serge

baichuan-inc/Baichuan-7B

lyogavin/airllm