llama-cpp
There are 110 repositories under llama-cpp topic.
getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
SciSharp/LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Mobile-Artificial-Intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
withcatai/node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
gotzmann/llama.go
llama.go is like llama.cpp in pure Golang!
undreamai/LLMUnity
Create characters in Unity with LLMs!
the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders
withcatai/catai
Run AI ✨ assistant locally! with simple API for Node.js 🚀
mybigday/llama.rn
React Native binding of llama.cpp
mdrokz/rust-llama.cpp
LLama.cpp rust bindings
jlonge4/local_llama
This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.
ptsochantaris/emeltal
Local ML voice chat using high-end models.
phronmophobic/llama.clj
Run LLMs locally. A clojure wrapper for llama.cpp.
gotzmann/booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
BrutalCoding/shady.ai
Making offline AI models accessible to all types of edge devices.
gpustack/gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
nuance1979/llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
nrl-ai/CustomChar
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
R3gm/InsightSolver-Colab
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
vtuber-plan/langport
Langport is a language model inference service
robiwan303/babyagi
BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and document embedding in langchain based on privateGPT
Abhi5h3k/PrivateDocBot
📚 Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy
OpenCSGs/llm-inference
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.
lucasjinreal/Crane
A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.
ystemsrx/Code-Atlas
A C++ implementation of Open Interpreter, based on llama.cpp. / Open Interpreter 的 C++ 实现,基于 llama.cpp
greynewell/musegpt
Local LLMs in your DAW!
rbourgeat/ImpAI
😈 ImpAI is an advanced role play app using large language and diffusion models.
fboulnois/llama-cpp-docker
Run llama.cpp in a GPU accelerated Docker container
hyparam/hyllama
llama.cpp gguf file parser for javascript
ossirytk/llama-cpp-chat-memory
Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma
tinyBigGAMES/Lumina
Local Generative AI
rbourgeat/llm-rp
✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙♂️
countzero/windows_llama.cpp
PowerShell automation to rebuild llama.cpp for a Windows environment.
daskol/llama.py
Python bindings to llama.cpp
viniciusarruda/llama-cpp-chat-completion-wrapper
Wrapper around llama-cpp-python for chat completion with LLaMA v2 models.
JavaLLM/llama4j
An easy-to-use Java SDK for running LLaMA models on edge devices, powered by LLaMA.cpp