fansinan

fansinan's Stars

CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python53.1k 941 1.1k8.8k
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Jupyter Notebook9.5k 185 5661.3k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.6k 177 2.4k2.2k
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python7.9k 183 2911.9k
xinghaochen/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
Language:Python3.1k 182 38534
YCG09/chinese_ocr
CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras
Language:Python2.8k 90 3741.1k
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.6k 76 8228
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1.6k 101 87320
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Language:Jupyter Notebook1.5k 23 78235
sigsep/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
Language:Python1.3k 33 78195
yodaos-project/yodaos
Yet another Linux distribution for voice-enabled IoT and embrace Web standards
Language:C1.2k 77 12133
alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Language:Python1.1k 68 222339
pykaldi/pykaldi
A Python wrapper for Kaldi
Language:Python1k 42 277247
DinoMan/speech-driven-animation
Language:Python953 56 70290
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Language:Python819 25 23182
timctho/convolutional-pose-machines-tensorflow
Language:Python794 43 80270
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Language:Python775 30 40196
YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
535 25 086
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Language:Python317 12 16134
aishell-foundation/DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
Language:Python299 13 463
tiberiu44/TTS-Cube
End-2-end speech synthesis with recurrent neural networks
Language:Python225 20 2445
madebyollin/acapellabot
Acapella Extraction with a ConvNet
Language:Python205 18 1344
mxer/awesome-speech
this is a treasure-house of speech
164 14 058
jinserk/pytorch-asr
ASR with PyTorch
Language:Python140 9 720
open-speech/cn-text-normalizer
A python module that convert chinese written string to read string. 一个python包：将中文书面字符串转换为口语字符串。
Language:Python118 7 233
ZhengkunTian/Speech-Tranformer-Pytorch
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
Language:Python116 10 426
CynthiaSuwi/ASR-for-Chinese-Pipeline
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
Language:Python115 7 429
tinyfool/webrtc-vad
Language:C34 7 18
yyj2013/webrtc_vad_for_mobile
This is a effective VAD(Voice Activity Detection) for iOS & Android. It is port from google webrtc.
Language:C10 4 05
errolyan/speech-aligner
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Language:C++31