Pinned Repositories
tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
exui
Web UI for ExLlamaV2
alpaca_lora_4bit
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
turboderp's Repositories
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
turboderp/alpaca_lora_4bit
turboderp/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
turboderp/formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
turboderp/tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
turboderp/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
turboderp/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model