/awesome-local-llms

Compare open-source local LLM inference projects by their metrics to assess popularity and activeness.

👋 Awesome Local LLMs

There are an overwhelming number of open-source tools for local LLM inference - for both proprietary and open weights LLMs. These tools generally lie within three categories:

  1. LLM inference backend engine
  2. LLM front end UI
  3. All-in-one desktop application

However these tools can overlap in scope with new features are constantly being added so I have chosen not to manually categorize or label features of each project.

GitHub repository metrics, like number of stars, contributors, issues, releases, and time since last commit, have been collected as a proxy for popularity and active maintenance.

Contributions are welcome! Feel free to suggest open-source repos that I have missed either in the Issues of this repo or run the script in the script branch and update the README and make a pull request.

For full table with all metrics go to this Google Sheet.

For my thoughts on local LLM tooling: https://vinlam.com/posts/local-llm-options/

Note the condensed table below has two filters applied:

  1. Repositories need more than 100 stars
  2. Repositories require a commit within the last 60 days

Open-Source Local LLM Projects

Last Updated: 12/06/2024

# Repo About Stars Forks Issues Contributors Releases License Time Since Last Commit
1 transformers 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. 127,442 25,261 1,112 433 151 Apache License 2.0 0 days, 8 hrs, 1 mins
2 ollama Get up and running with Llama 3, Mistral, Gemma, and other large language models. 72,809 5,390 976 239 65 MIT License 0 days, 12 hrs, 30 mins
3 ChatGPT-Next-Web A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。 71,529 57,065 333 183 60 MIT License 2 days, 0 hrs, 44 mins
4 gpt4all gpt4all: run open-source LLMs anywhere 65,760 7,241 430 101 16 MIT License 0 days, 13 hrs, 5 mins
5 gpt_academic 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。 60,166 7,514 270 85 29 GNU General Public License v3.0 1 days, 21 hrs, 12 mins
6 llama.cpp LLM inference in C/C++ 60,069 8,554 569 473 1,937 MIT License 0 days, 8 hrs, 7 mins
7 gpt4free The official gpt4free repository, various collection of powerful language models 58,683 13,144 52 204 140 GNU General Public License v3.0 0 days, 20 hrs, 44 mins
8 privateGPT Interact with your documents using the power of GPT, 100% privately, no data leaks 52,600 7,054 263 75 7 Apache License 2.0 0 days, 11 hrs, 10 mins
9 text-generation-webui A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models. 37,649 5,017 195 305 41 GNU Affero General Public License v3.0 1 days, 15 hrs, 27 mins
10 lobe-chat 🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application. 32,395 7,757 373 116 688 MIT License 0 days, 8 hrs, 29 mins
11 chatbot-ui AI chat for every model. 26,988 7,461 143 47 0 MIT License 2 days, 16 hrs, 45 mins
12 open-webui User-friendly WebUI for LLMs (Formerly Ollama WebUI) 26,615 2,850 121 147 33 MIT License 0 days, 9 hrs, 18 mins
13 LocalAI 🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities. 21,092 1,609 302 96 48 MIT License 0 days, 15 hrs, 15 mins
14 vllm A high-throughput and memory-efficient inference and serving engine for LLMs 20,855 2,882 1,145 358 27 Apache License 2.0 0 days, 9 hrs, 13 mins
15 jan Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM) 19,900 1,150 165 48 24 GNU Affero General Public License v3.0 0 days, 23 hrs, 48 mins
16 localGPT Chat with your documents on your local device using GPT models. No data leaves your device and 100% private. 19,427 2,165 466 42 0 Apache License 2.0 17 days, 14 hrs, 52 mins
17 chatbox User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...) 19,360 1,969 290 28 62 GNU General Public License v3.0 3 days, 19 hrs, 13 mins
18 mlc-llm Universal LLM Deployment Engine with ML Compilation 17,504 1,390 129 116 1 Apache License 2.0 0 days, 11 hrs, 2 mins
19 llamafile Distribute and run LLMs with a single file. 16,336 811 101 40 21 Other 4 days, 6 hrs, 43 mins
20 anything-llm The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities. 15,859 1,665 157 49 0 MIT License 0 days, 11 hrs, 22 mins
21 ChuanhuChatGPT GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. 14,943 2,259 121 49 21 GNU General Public License v3.0 14 days, 17 hrs, 49 mins
22 LibreChat Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development 12,300 2,203 90 122 41 MIT License 0 days, 21 hrs, 33 mins
23 web-llm High-performance In-browser LLM Inference Engine 11,127 692 45 33 1 Apache License 2.0 0 days, 9 hrs, 5 mins
24 h2ogpt Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/ 10,863 1,194 262 67 2 Apache License 2.0 0 days, 9 hrs, 9 mins
25 chathub All-in-one chatbot client 9,708 968 311 12 0 GNU General Public License v3.0 11 days, 2 hrs, 14 mins
26 OpenLLM Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud. 9,108 581 56 26 118 Apache License 2.0 0 days, 20 hrs, 10 mins
27 FlexGen Running large language models on a single GPU for throughput-oriented scenarios. 9,052 527 56 18 0 Apache License 2.0 53 days, 16 hrs, 5 mins
28 text-generation-inference Large Language Model Text Generation Inference 8,215 916 157 91 43 Apache License 2.0 0 days, 16 hrs, 52 mins
29 server The Triton Inference Server provides an optimized cloud and edge inferencing solution. 7,591 1,406 496 113 67 BSD 3-Clause "New" or "Revised" License 0 days, 8 hrs, 18 mins
30 TensorRT-LLM TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines. 7,100 762 612 13 6 Apache License 2.0 0 days, 8 hrs, 34 mins
31 llama-cpp-python Python bindings for llama.cpp 6,941 824 394 141 212 MIT License 1 days, 14 hrs, 37 mins
32 chat-ui Open source codebase powering the HuggingChat app 6,605 927 221 87 12 Apache License 2.0 0 days, 15 hrs, 13 mins
33 SillyTavern LLM Frontend for Power Users. 6,503 1,952 328 123 80 GNU Affero General Public License v3.0 0 days, 14 hrs, 10 mins
34 openplayground An LLM playground you can run on your laptop 6,128 477 84 16 0 MIT License 20 days, 11 hrs, 5 mins
35 big-agi Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. 4,619 1,054 151 37 16 MIT License 0 days, 8 hrs, 41 mins
36 koboldcpp A simple one-file way to run various GGML and GGUF models with KoboldAI's UI 4,187 304 216 470 79 GNU Affero General Public License v3.0 0 days, 11 hrs, 21 mins
37 lollms-webui Lord of Large Language Models Web User Interface 3,992 506 142 38 21 Apache License 2.0 0 days, 11 hrs, 54 mins
38 llm Access large language models from the command-line 3,272 163 188 19 25 Apache License 2.0 9 days, 23 hrs, 32 mins
39 exllamav2 A fast inference library for running LLMs locally on modern consumer-class GPUs 3,112 228 110 39 26 MIT License 1 days, 10 hrs, 49 mins
40 inference Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. 3,088 258 327 53 64 Apache License 2.0 0 days, 8 hrs, 1 mins
41 lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 2,919 256 166 54 28 Apache License 2.0 0 days, 8 hrs, 15 mins
42 LLamaSharp A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently. 2,161 292 118 48 17 MIT License 0 days, 9 hrs, 3 mins
43 nitro Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM). Powers 👋 Jan 1,698 89 69 26 84 No license 0 days, 8 hrs, 24 mins
44 chatbot-ollama Chatbot Ollama is an open source chat UI for Ollama. 1,231 196 18 6 1 Other 53 days, 1 hrs, 34 mins
45 LLMFarm llama and other large language models on iOS and MacOS offline using GGML library. 1,014 60 10 1 28 MIT License 15 days, 20 hrs, 14 mins
46 maid Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely. 945 93 8 15 29 MIT License 0 days, 8 hrs, 24 mins
47 oterm a text-based terminal client for Ollama 827 40 9 9 18 MIT License 20 days, 17 hrs, 48 mins
48 page-assist Use your locally running AI models to assist you in your web browsing 766 90 52 8 8 MIT License 0 days, 16 hrs, 20 mins
49 amica Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition. 566 86 43 16 4 MIT License 0 days, 18 hrs, 50 mins
50 catai Run AI ✨ assistant locally! with simple API for Node.js 🚀 418 27 1 4 24 MIT License 29 days, 23 hrs, 34 mins
51 FreeChat llama.cpp based AI chat app for macOS 388 31 20 4 0 MIT License 13 days, 13 hrs, 18 mins
52 exui Web UI for ExLlamaV2 377 31 26 7 0 MIT License 2 days, 15 hrs, 40 mins
53 LocalAIVoiceChat Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis. 367 36 9 2 1 Other 6 days, 21 hrs, 45 mins
54 ava All-in-one desktop app for running LLMs locally. 358 15 9 3 0 Other 34 days, 14 hrs, 39 mins
55 tenere 🔥 TUI interface for LLMs written in Rust 254 9 1 7 12 GNU General Public License v3.0 1 days, 1 hrs, 46 mins
56 ChatterUI Simple frontend for LLMs built in react-native. 164 9 4 2 22 GNU Affero General Public License v3.0 1 days, 5 hrs, 18 mins
57 mikupad LLM Frontend in a single html file 153 22 20 9 13 Creative Commons Zero v1.0 Universal 2 days, 20 hrs, 28 mins
58 emeltal Local ML voice chat using high-end models. 123 6 0 1 0 MIT License 0 days, 13 hrs, 33 mins

Inspired By