adamreis's Stars
meta-llama/llama
Inference code for Llama models
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
2noise/ChatTTS
A generative speech model for daily dialogue.
karpathy/llama2.c
Inference Llama 2 in one file of pure C
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Vaibhavs10/insanely-fast-whisper
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
joshpxyne/gpt-migrate
Easily migrate your codebase from one framework or language to another.
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
guardrails-ai/guardrails
Adding guardrails to large language models.
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
pyutils/line_profiler
Line-by-line profiling for Python
Rikorose/DeepFilterNet
Noise supression using deep filtering
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
MeetKai/functionary
Chat language model that can use tools and interpret the results
acids-ircam/RAVE
Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder
srush/MiniChain
A tiny library for coding with large language models.
ddupont808/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
MDK8888/GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
usefulsensors/useful-transformers
Efficient Inference of Transformer models
radi-cho/botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstract reasoning, brainstorming).
maxbbraun/whisper-edge
OpenAI Whisper for edge devices
optskug/docs
Documentation/News/History on openpilot with Toyota/Lexus/Subaru with TSK/ECU SECURITY KEY