fansinan's Stars
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
espnet/espnet
End-to-End Speech Processing Toolkit
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
xinghaochen/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
YCG09/chinese_ocr
CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
sigsep/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
yodaos-project/yodaos
Yet another Linux distribution for voice-enabled IoT and embrace Web standards
alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
pykaldi/pykaldi
A Python wrapper for Kaldi
DinoMan/speech-driven-animation
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
timctho/convolutional-pose-machines-tensorflow
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
aishell-foundation/DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
tiberiu44/TTS-Cube
End-2-end speech synthesis with recurrent neural networks
madebyollin/acapellabot
Acapella Extraction with a ConvNet
mxer/awesome-speech
this is a treasure-house of speech
jinserk/pytorch-asr
ASR with PyTorch
open-speech/cn-text-normalizer
A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。
ZhengkunTian/Speech-Tranformer-Pytorch
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
CynthiaSuwi/ASR-for-Chinese-Pipeline
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
tinyfool/webrtc-vad
yyj2013/webrtc_vad_for_mobile
This is a effective VAD(Voice Activity Detection) for iOS & Android. It is port from google webrtc.
errolyan/speech-aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription