llamacpp

There are 591 repositories under llamacpp topic.

janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
Language:TypeScript39.2k 206 3.1k2.4k
khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Language:Python31.5k 154 5551.9k
llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
Language:Python14.4k 59 1653k
getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Language:TypeScript11k 80 129711
LostRuins/koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language:C++8.9k 84 1.2k577
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python8.7k 56 2.6k759
reorproject/reor
Private & local AI personal knowledge management app for high entropy people.
Language:JavaScript8.4k 53 215508
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Language:Svelte5.8k 45 180401
JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
Language:Scala4.1k 98 908733
gptme/gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
Language:Python4.1k 41 271336
gpustack/gpustack
Simple, scalable AI model deployment on GPU clusters
Language:Python4k 37 1.7k399
cactus-compute/cactus
Kernels & AI inference engine for phones
Language:C++3.7k 31 70217
twinnydotdev/twinny
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.
Language:TypeScript3.6k 21 266212
SciSharp/LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Language:C#3.4k 64 446475
Michael-A-Kuykendall/shimmy
⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.
Language:Rust3.4k 31 100237
Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
Language:Python3.1k 68 484432
SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
Language:Rust3k 23 72108
RunanywhereAI/runanywhere-sdks
Production ready toolkit to run AI locally
Language:Kotlin3k 14 10955
janhq/cortex.cpp
Local AI API Platform
Language:C++2.8k 27 887181
containers/ramalama
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.
Language:Python2.3k 34 468272
Mobile-Artificial-Intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
Language:Dart2.2k 34 189225
floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
Language:Rust2.1k 26 125117
alexpinel/Dot
Text-To-Speech, RAG, and LLMs. All local!
Language:JavaScript1.8k 21 14109
mostlygeek/llama-swap
Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc
Language:Go1.8k 12 193118
alexrozanski/LlamaChat
Chat with your favourite LLaMA models in a native macOS app
Language:Swift1.5k 15 4363
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language:JavaScript1.4k 7 1785
intentee/paddler
Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙
Language:Rust1.4k 10 4666
vercel/modelfusion
The TypeScript library for building AI applications.
Language:TypeScript1.3k 12 6691
awaescher/OllamaSharp
The easiest way to use Ollama in .NET
Language:C#1.2k 25 137170
benman1/generative_ai_with_langchain
Build production-ready LLM applications and advanced agents using Python, LangChain, and LangGraph. This is the companion repository for the book on generative AI with LangChain.
Language:Jupyter Notebook1.1k 19 67479
Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
Language:Python1k 16 665
ngxson/wllama
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Language:TypeScript928 11 9958
Atome-FE/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
Language:Rust870 14 7067
huggingface/llm-ls
LSP server leveraging LLMs for code completion (and more?)
Language:Rust829 21 3766
mukel/llama3.java
Practical Llama 3 inference in Java
Language:Java785 28 2093
if-ai/ComfyUI-IF_AI_tools
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
Language:Python680 8 14053

llamacpp

janhq/jan

khoj-ai/khoj

llmware-ai/llmware

getumbrel/llama-gpt

LostRuins/koboldcpp

xorbitsai/inference

reorproject/reor

serge-chat/serge

JohnSnowLabs/spark-nlp

gptme/gptme

gpustack/gpustack

cactus-compute/cactus

twinnydotdev/twinny

SciSharp/LLamaSharp

Michael-A-Kuykendall/shimmy

Josh-XT/AGiXT

SilasMarvin/lsp-ai

RunanywhereAI/runanywhere-sdks

janhq/cortex.cpp

containers/ramalama

Mobile-Artificial-Intelligence/maid

floneum/floneum

alexpinel/Dot

mostlygeek/llama-swap

alexrozanski/LlamaChat

RahulSChand/gpu_poor

intentee/paddler

vercel/modelfusion

awaescher/OllamaSharp

benman1/generative_ai_with_langchain

Dicklesworthstone/swiss_army_llama

ngxson/wllama

Atome-FE/llama-node

huggingface/llm-ls

mukel/llama3.java

if-ai/ComfyUI-IF_AI_tools