gray4what's Stars
liuzhao1225/YouDub-webui
nidhaloff/deep-translator
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
gaborvecsei/whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
k2-fsa/next-gen-kaldi-wechat
PKUFlyingPig/cs-self-learning
计算机自学指南
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
huiscliu/Tutorials
Parallel programming tutorials
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
GrowingGit/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
ggerganov/llama.cpp
LLM inference in C/C++
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
athena-team/athena-decoder
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
pettarin/forced-alignment-tools
A collection of links and notes on forced alignment tools
jegesh/python-sqs-listener
A simple wrapper for boto3 for listening, and sending, to an AWS SQS queue
majianjia/nnom
A higher-level Neural Network library for microcontrollers.
datawhalechina/thorough-pytorch
PyTorch入门教程,在线阅读地址:https://datawhalechina.github.io/thorough-pytorch/
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
jackyyy0228/WFST-decoder-for-phoneme-posterior