Pinned Repositories
3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
aichallenge
xunfei dialect baseline
Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
clari_wavenet_vocoder
cppjieba
"结巴"中文分词的C++版本
darknet
Convolutional Neural Networks
dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
dctts-pytorch
The pytorch implementation of DC-TTS
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf
SpeechSynthesis
语音合成综述
lbqin's Repositories
lbqin/speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
lbqin/SpeechSynthesis
语音合成综述
lbqin/aichallenge
xunfei dialect baseline
lbqin/clari_wavenet_vocoder
lbqin/cppjieba
"结巴"中文分词的C++版本
lbqin/darknet
Convolutional Neural Networks
lbqin/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf
lbqin/GCommandsPytorch
ConvNets for Audio Recognition using Google Commands Dataset
lbqin/kaldi
This is now the official location of the Kaldi project.
lbqin/kaldi-enhan
Tools for speech enhancement based on kaldi
lbqin/LPCNet
Efficient neural speech synthesis
lbqin/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
lbqin/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
lbqin/merlin
This is now the official location of the Merlin project.
lbqin/ML-KWS-for-MCU
Keyword spotting on Arm Cortex-M Microcontrollers
lbqin/MMdnn
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.
lbqin/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
lbqin/MTTS
A Demo of Mandarin/Chinese TTS frontend
lbqin/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
lbqin/parallel_wavenet_vocoder
Parallel WaveNet Vocoder Based on ClariNet
lbqin/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
lbqin/Sinsy-Remix
The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"
lbqin/speech_commands
lbqin/tacotron-1
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
lbqin/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
lbqin/THULAC
An Efficient Lexical Analyzer for Chinese
lbqin/TTS
Deep learning for Text2Speech
lbqin/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
lbqin/web-speech-api
A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.
lbqin/World
A high-quality speech analysis, manipulation and synthesis system