llama

There are 962 repositories under llama topic.

  • ollama/ollama

    Get up and running with Llama 3, Mistral, Gemma, and other large language models.

    Language:Go72.7k4413.1k5.4k
  • llama.cpp

    ggerganov/llama.cpp

    LLM inference in C/C++

    Language:C++60k5083.2k8.5k
  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

    Language:Python28.8k2673.3k5.1k
  • LLaMA-Factory

    hiyouga/LLaMA-Factory

    Unify Efficient Fine-Tuning of 100+ LLMs

    Language:Python24k1643.8k3k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

    Language:C++21.1k1587081.6k
  • vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Language:Python20.8k1963k2.9k
  • Chinese-LLaMA-Alpaca

    ymcui/Chinese-LLaMA-Alpaca

    中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

    Language:Python17.7k1847281.8k
  • haotian-liu/LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Language:Python17.4k1561.3k1.9k
  • GaiZhenbiao/ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

    Language:Python14.9k857732.3k
  • cocktailpeanut/dalai

    The simplest way to run LLaMA on your local machine

    Language:CSS13.1k1483801.4k
  • LlamaFamily/Llama-Chinese

    Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

    Language:Python12.6k1423191.1k
  • HqWu-HITCS/Awesome-Chinese-LLM

    整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

  • PaddleNLP

    PaddlePaddle/PaddleNLP

    👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

    Language:Python11.6k1033.5k2.9k
  • unslothai/unsloth

    Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

    Language:Python11.3k80470723
  • ludwig

    ludwig-ai/ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

    Language:Python10.9k1931.1k1.2k
  • getumbrel/llama-gpt

    A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

    Language:TypeScript10.5k81125660
  • meta-llama/llama-recipes

    Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

    Language:Jupyter Notebook10.1k832831.4k
  • bentoml/OpenLLM

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

    Language:Python9.1k55251581
  • bigscience-workshop/petals

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Language:Python8.8k89187478
  • TheR1D/shell_gpt

    A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

    Language:Python8.6k81292681
  • LianjiaTech/BELLE

    BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

    Language:HTML7.7k107438739
  • SJTU-IPADS/PowerInfer

    High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

    Language:C++7.1k75135378
  • Chinese-LLaMA-Alpaca-2

    ymcui/Chinese-LLaMA-Alpaca-2

    中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

    Language:Python7k75382567
  • zilliztech/GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

    Language:Python6.6k57155460
  • reorproject/reor

    Private & local AI personal knowledge management app.

    Language:TypeScript6.4k38121367
  • baichuan-inc/Baichuan-7B

    A large-scale 7B pretraining language model developed by BaiChuan-Inc.

    Language:Python5.6k66127500
  • serge

    serge-chat/serge

    A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

    Language:Svelte5.6k54179398
  • k8sgpt-ai/k8sgpt

    Giving Kubernetes Superpowers to everyone

    Language:Go5.1k53259564
  • yangjianxin1/Firefly

    Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

    Language:Python5k52247462
  • SCIR-HI/Huatuo-Llama-Med-Chinese

    Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

    Language:Python4.4k44103431
  • Facico/Chinese-Vicuna

    Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

    Language:C4.1k58244427
  • Instruction-Tuning-with-GPT-4/GPT-4-LLM

    Instruction Tuning with GPT-4

    Language:HTML4k4533296
  • arcee-ai/mergekit

    Tools for merging pretrained large language models.

    Language:Python3.8k47237330
  • h2oai/h2o-llmstudio

    H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

    Language:Python3.7k45353390
  • langchain4j/langchain4j

    Java version of LangChain

    Language:Java3.7k70524699
  • lyogavin/Anima

    33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

    Language:Jupyter Notebook3.4k98131281