Pinned Repositories
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Alibaba-MIT-Speech
Alibaba speech technology
alignhelper
Alignment Helper For Generate Audio Text Pairs. 生成语音和文本对齐的语音合成语料,生成的语料再进行人工核对,即可用于训练语音合成模型。
AntiFraudChatBot
A simple prompt-chatting AI based on wechaty and fintuned NLP model
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
audio-annotator
音频标注工具
audio-annotator-1
A JavaScript interface for annotating and labeling audio files.
aukit
audio toolkit. 好用的语音处理工具箱,包含语音降噪、音频格式转换、特征频谱生成等模块。
tacotron2
Forked from https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2 and merged with https://github.com/Rayhane-mamah/Tacotron-2
TTS
Deep learning for Text to Speech
SDlibowen's Repositories
SDlibowen/AntiFraudChatBot
A simple prompt-chatting AI based on wechaty and fintuned NLP model
SDlibowen/Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SDlibowen/ddsp
DDSP: Differentiable Digital Signal Processing
SDlibowen/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
SDlibowen/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
SDlibowen/diff-svc
Singing Voice Conversion via diffusion model
SDlibowen/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community
SDlibowen/Diffusion-SVC
SDlibowen/fish-diffusion
An easy to understand TTS / SVS / SVC framework
SDlibowen/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
SDlibowen/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
SDlibowen/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
SDlibowen/lora-svc
singing voice change based on whisper, and lora for singing voice clone
SDlibowen/megatts2
Unoffical implementation of Megatts2
SDlibowen/midieditor
Provides an interface to edit, record, and play Midi data
SDlibowen/MioTTS
使用C++ OnnxRuntime 重构了Tacotron2的推理,使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件
SDlibowen/MoeGoe
Executable file for VITS inference
SDlibowen/MoeTTS
Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan and VITS
SDlibowen/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
SDlibowen/OpenUtau
OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95%EF%BC%88%E4%B8%AD%E6%96%87%EF%BC%89
SDlibowen/pc-ddsp
Pitch Controllable DDSP Vocoders
SDlibowen/pc-ddsp-1
Pitch Controllable DDSP Vocoders
SDlibowen/qsynthesis-revenge
Cross-platform SVS frontend
SDlibowen/RMVPE
SDlibowen/so-vits-svc
基于vits与softvc的歌声音色转换模型
SDlibowen/so-vits-svc-1
SoftVC VITS Singing Voice Conversion
SDlibowen/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
SDlibowen/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
SDlibowen/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
SDlibowen/VST_NetProcess-
async http process VST plugin