turboderp

Pinned Repositories

tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
Language:Python888 12 174106
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python4.1k 35 533306
exui
Web UI for ExLlamaV2
Language:JavaScript486 9 5546
alpaca_lora_4bit
Language:Python8 3 02
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Language:Python2.8k 36 220221
formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
Language:Python1 0 00
tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
Language:Python1 0 00
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
Language:Python4 1 00
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python1 0 00

turboderp's Repositories

turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Language:Python2.8k 36 220221
turboderp/alpaca_lora_4bit
Language:Python8 3 02
turboderp/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
Language:Python4 1 00
turboderp/formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
Language:Python1 0 00
turboderp/tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
Language:Python1 0 00
turboderp/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python1 0 00
turboderp/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
Language:Python0 0 00