WanCaiYan

Pinned Repositories

accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 0 00
ai-deployment
关注AI模型上线、模型部署
Language:Jupyter Notebook0 0 00
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
Language:Python0 0 00
audio_diarization_annotation
Audio Diarization Annotation tool
Language:JavaScript0 0 00
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
Language:Python0 0 00
AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Language:Python0 0 00
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
0 0 00
BVAE-TTS
Official implementation of BVAE-TTS
Language:Python0 0 00
chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
0 0 00
espnet_tts_frontend
Text frontend for ESPnet tts recipes
Language:Python1 0 00

WanCaiYan's Repositories

WanCaiYan/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 0 00
WanCaiYan/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
Language:Python0 0 00
WanCaiYan/auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
Language:Python0 0 00
WanCaiYan/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Language:Python0 0 00
WanCaiYan/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
0 0 00
WanCaiYan/BVAE-TTS
Official implementation of BVAE-TTS
Language:Python0 0 00
WanCaiYan/chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
0 0 00
WanCaiYan/clean-text
🧹 Python package for text cleaning
Language:Python0 0 00
WanCaiYan/deep-learning-model-convertor
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
0 0
WanCaiYan/free-programming-books
:books: Freely available programming books
0 0
WanCaiYan/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python0 0
WanCaiYan/interesting-python
有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)
Language:Jupyter Notebook0 0
WanCaiYan/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
0 0
WanCaiYan/KAN-TTS
Language:Python0 0
WanCaiYan/knn-vc
Voice Conversion With Just Nearest Neighbors
Language:Python0 0
WanCaiYan/LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Language:Python0 0
WanCaiYan/mfa-models
Collection of pretrained models for the Montreal Forced Aligner
Language:Python0 0
WanCaiYan/PSST
Prosodic Speech Segmentation with Transformers
Language:Python0 0
WanCaiYan/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Language:Python0 0
WanCaiYan/SMART-NAR_Fast_TTS
Language:Python0 0
WanCaiYan/SpeechAlgorithms
Speech Algorithms
Language:C0 0
WanCaiYan/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
0 0
WanCaiYan/TTS_TFLite
This repository is a collection of TTS Models in TFLite
Language:Jupyter Notebook0 0
WanCaiYan/VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
Language:Python0 0
WanCaiYan/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Language:Python0 0
WanCaiYan/visqol
Perceptual Quality Estimator for speech and audio
Language:C++0 0
WanCaiYan/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python0 0
WanCaiYan/voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
Language:Python0 0
WanCaiYan/WavJourney
WavJourney: Compositional Audio Creation with LLMs
0 0
WanCaiYan/zhtts
A demo of zh/Chinese Text to Speech system run on CPU in real time.
Language:Python0 0