ggml

There are 95 repositories under ggml topic.

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.7k 560 4.3k10.2k
rustformers/llm
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
Language:Rust6.1k 51 231369
LostRuins/koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language:C++6.1k 68 868391
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python6k 43 1.6k490
leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
Language:C++3.7k 54 318327
guinmoon/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
Language:Swift1.5k 20 10699
RWKV/rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Language:C++1.4k 23 82100
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language:JavaScript1.2k 7 1662
PABannier/bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Language:C++762 39 9163
azkadev/whisper
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
Language:C++580 14 037
abacaj/mpt-30B-inference
Run inference on MPT-30B using CPU
Language:Python575 13 1194
Maknee/minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Language:C++562 9 1327
the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders
Language:Python551 12 23633
monatis/clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
Language:C++474 16 5637
shm007g/LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
Language:HTML431 34 824
azkadev/bark
WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference
Language:C++384 5 03
azkadev/general_ai
GENERAL Ai Library For DART & Flutter
Language:C++326 2 01
staghado/vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
Language:C++244 7 818
mayooear/private-chatbot-mpt30b-langchain
Chat with your data privately using MPT-30b
Language:Python182 4 843
balisujohn/tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
Language:C++174 15 2016
mgonzs13/llama_ros
llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2
Language:C++174 4 427
abacaj/replit-3B-inference
Run inference on replit-3B code instruct model using CPU
Language:Python154 3 1328
gotzmann/booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
Language:C++145 7 56
chenhunghan/ialacol
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
Language:Python144 3 2317
zhouwg/kantv
workbench for learing&practising AI tech in real scenario on Android device, powered by GGML(Georgi Gerganov Machine Learning) and NCNN(Tencent NCNN) and FFmpeg
Language:C++134 8 7820
zatevakhin/obsidian-local-llm
Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.
Language:TypeScript132 4 45
shubham0204/SmolChat-Android
Running any GGUF SLMs/LLMs locally, on-device in Android
Language:Kotlin131 4 2112
guoriyue/LangCommand
LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.
Language:C++124 3 05
sevagh/demucs.cpp
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
Language:C++111 5 1314
nrl-ai/CustomChar
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
Language:C++104 5 210
mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Language:C++63 4 214
Mobile-Artificial-Intelligence/maid_llm
maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
Language:Dart57 5 1015
rbourgeat/ImpAI
😈 ImpAI is an advanced role play app using large language and diffusion models.
Language:JavaScript57 3 54
seasonjs/stable-diffusion
pure go for stable-diffusion and support cross-platform.
Language:Go47 4 26
ahoylabs/gguf.js
A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.
Language:TypeScript44 2 11
Uminosachi/open-llm-webui
This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).
Language:Python42 3 16

ggml

ggerganov/llama.cpp

rustformers/llm

LostRuins/koboldcpp

xorbitsai/inference

leejet/stable-diffusion.cpp

guinmoon/LLMFarm

RWKV/rwkv.cpp

RahulSChand/gpu_poor

PABannier/bark.cpp

azkadev/whisper

abacaj/mpt-30B-inference

Maknee/minigpt4.cpp

the-crypt-keeper/can-ai-code

monatis/clip.cpp

shm007g/LLaMA-Cult-and-More

azkadev/bark

azkadev/general_ai

staghado/vit.cpp

mayooear/private-chatbot-mpt30b-langchain

balisujohn/tortoise.cpp

mgonzs13/llama_ros

abacaj/replit-3B-inference

gotzmann/booster

chenhunghan/ialacol

zhouwg/kantv

zatevakhin/obsidian-local-llm

shubham0204/SmolChat-Android

guoriyue/LangCommand

sevagh/demucs.cpp

nrl-ai/CustomChar

mgonzs13/whisper_ros

Mobile-Artificial-Intelligence/maid_llm

rbourgeat/ImpAI

seasonjs/stable-diffusion

ahoylabs/gguf.js

Uminosachi/open-llm-webui