rti's Stars
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
scribe-org/Scribe-Android
Android app with keyboards for language learners
scribe-org/Scribe-iOS
iOS app with keyboards for language learners
scribe-org/Scribe-Desktop
Typing GUI for language learners on Windows, Mac and Linux
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
runpod-workers/worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
AbanteAI/mentat
Mentat - The AI Coding Assistant
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
fastai/lm-hackers
Hackers' Guide to Language Models
brevdev/notebooks
Collection of notebook guides created by the Brev.dev team!
gravitl/netmaker
Netmaker makes networks with WireGuard. Netmaker automates fast, secure, and distributed virtual networks.
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
jpalardy/vim-slime
A vim plugin to give you some slime. (Emacs)
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
mistralai/mistral-inference
Official inference library for Mistral models
numtide/treefmt
one CLI to format your repo [maintainers=@zimbatm,@brianmcgee]
juyongjiang/CodeUp
CodeUp: A Multilingual Code Generation Llama2 Model with Parameter-Efficient Instruction-Tuning on a Single RTX 3090
OpenAccess-AI-Collective/servereless-runpod-ggml
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
abetlen/llama-cpp-python
Python bindings for llama.cpp
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
TheBlokeAI/dockerLLM
TheBloke's Dockerfiles
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
huggingface/text-generation-inference
Large Language Model Text Generation Inference
jackMort/ChatGPT.nvim
ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API
paul-gauthier/aider
aider is AI pair programming in your terminal