gray4what

gray4what's Stars

liuzhao1225/YouDub-webui
Language:Python1.9k199
nidhaloff/deep-translator
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
Language:Python1.6k178
gaborvecsei/whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
Language:Python14321
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Language:Python1.9k257
k2-fsa/next-gen-kaldi-wechat
3210
PKUFlyingPig/cs-self-learning
计算机自学指南
Language:HTML56.7k6.8k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.9k5.7k
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Language:Python27188
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.6k4.8k
zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
Language:Python53.9k7.2k
huiscliu/Tutorials
Parallel programming tutorials
Language:C601205
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python11.9k994
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.8k1.2k
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python17.8k1.3k
GrowingGit/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜，各语言分设「软件 | 资料」榜单，精准定位中文好项目。各取所需，高效学习。
Language:Java99.7k13.1k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.5k2.2k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++66.2k9.5k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.1k1.8k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C35k3.6k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11k1.8k
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
Language:Python22.7k7.9k
athena-team/athena-decoder
Language:Python7526
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python69.4k8.2k
pettarin/forced-alignment-tools
A collection of links and notes on forced alignment tools
Language:Python86886
jegesh/python-sqs-listener
A simple wrapper for boto3 for listening, and sending, to an AWS SQS queue
Language:Python15471
majianjia/nnom
A higher-level Neural Network library for microcontrollers.
Language:C916245
datawhalechina/thorough-pytorch
PyTorch入门教程，在线阅读地址：https://datawhalechina.github.io/thorough-pytorch/
Language:Jupyter Notebook2.5k414
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Language:Python33750
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
Language:Python808129
jackyyy0228/WFST-decoder-for-phoneme-posterior
Language:Shell227