waywardspooky

waywardspooky's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.2k 180 5183.4k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python11.7k 134 6881.2k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.6k 133 1.1k1.4k
Vaibhavs10/insanely-fast-whisper
Language:Jupyter Notebook7.5k 65 189529
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k 55 2051.2k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k 71 989757
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k 58 152384
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.3k 55 97430
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.1k 51 229402
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.8k 78 125652
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.4k 45 170286
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Language:Python2k 30 2684
kadirnar/whisper-plus
WhisperPlus: Faster, Smarter, and More Capable 🚀
Language:Python1.7k 19 50137
xenova/whisper-web
ML-powered speech recognition directly in your browser
Language:TypeScript1.7k 13 32190
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Language:Python1.6k 22 70130
sc0ty/subsync
Subtitle Speech Synchronizer
Language:C++1.3k 24 17853
abdeladim-s/subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
Language:Python1.3k 14 96104
pydn/ComfyUI-to-Python-Extension
A powerful tool that translates ComfyUI workflows into executable Python code.
Language:Python1.1k 7 55113
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML931 18 229106
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python705 7 1537
McCloudS/subgen
Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr
Language:Python562 6 7648
MasayaKawamura/MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Language:Python417 17 2664
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
Language:Python369 7 7285
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Language:Python315 10 3125
tomchang25/whisper-auto-transcribe
Auto transcribe tool based on whisper
Language:Python215 6 4814
Vaibhavs10/optimise-my-whisper
Language:Jupyter Notebook180 7 117
matatonic/openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
Language:Python172 5 1214
Vali-98/XTTS-RVC-UI
A Gradio UI for XTTSv2 and RVC.
Language:Python131 4 1949
metavoiceio/MetaVoiceLive
Language:JavaScript67 6 59
deepestcyber/vmse2000-detector
Language:Python31