Pinned Repositories
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
ai-deployment
关注AI模型上线、模型部署
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
audio_diarization_annotation
Audio Diarization Annotation tool
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
BVAE-TTS
Official implementation of BVAE-TTS
chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
espnet_tts_frontend
Text frontend for ESPnet tts recipes
WanCaiYan's Repositories
WanCaiYan/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
WanCaiYan/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
WanCaiYan/auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
WanCaiYan/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
WanCaiYan/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
WanCaiYan/BVAE-TTS
Official implementation of BVAE-TTS
WanCaiYan/chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
WanCaiYan/clean-text
🧹 Python package for text cleaning
WanCaiYan/deep-learning-model-convertor
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
WanCaiYan/free-programming-books
:books: Freely available programming books
WanCaiYan/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
WanCaiYan/interesting-python
有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)
WanCaiYan/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
WanCaiYan/KAN-TTS
WanCaiYan/knn-vc
Voice Conversion With Just Nearest Neighbors
WanCaiYan/LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
WanCaiYan/mfa-models
Collection of pretrained models for the Montreal Forced Aligner
WanCaiYan/PSST
Prosodic Speech Segmentation with Transformers
WanCaiYan/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
WanCaiYan/SMART-NAR_Fast_TTS
WanCaiYan/SpeechAlgorithms
Speech Algorithms
WanCaiYan/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
WanCaiYan/TTS_TFLite
This repository is a collection of TTS Models in TFLite
WanCaiYan/VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
WanCaiYan/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
WanCaiYan/visqol
Perceptual Quality Estimator for speech and audio
WanCaiYan/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
WanCaiYan/voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
WanCaiYan/WavJourney
WavJourney: Compositional Audio Creation with LLMs
WanCaiYan/zhtts
A demo of zh/Chinese Text to Speech system run on CPU in real time.