exllama
There are 12 repositories under exllama topic.
gotzmann/booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
c0sogi/llama-api
An OpenAI-like LLaMA inference API
innightwolfsleep/text-generation-webui-telegram_bot
LLM telegram bot
shinomakoi/magi_llm_gui
A Qt GUI for large language models
shinomakoi/AI-Messenger
A QT GUI for large language models
silphendio/sliced_llama
Simple LLM inference server
NO-ob/simpleLlama
A Simple webserver for generating text with exllamav2
Aqirito/A.L.I.C.E
A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system
seyf1elislam/LocalLLM_OneClick_Colab
Run gguf LLM models in Latest Version TextGen-webui
kooten111/EasyEXL
A Python script designed to streamline the process of quantizing models to exllamav2 format
alexkreidler/quote-constraint
A constrained generation filter for local LLMs that makes them quote properly from a source document
countzero/windows_exllama
This is a playground to explore the ExLlama project in a Windows environment.