LiuMingYy

LiuMingYy's Stars

CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
179k 5.3k 59651.2k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python26.3k 180 1304.9k
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.2k 49 01.2k
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.8k 82 154770
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook5.1k 117 5621.4k
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.8k 41 572722
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.6k 59 71313
PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
Language:Python2.7k 29 166922
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k 28 221547
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 55 31104
bootphon/phonemizer
Simple text to phones converter for multiple languages
Language:Python1.3k 23 156176
auspicious3000/autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Language:Python1k 30 113210
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
1k 61 4121
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。
Language:Python630 12 71109
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python616 19 88112
p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Language:Python504 25 5895
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python472 30 3468
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python380 16 5231
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Language:Python367 10 1653
wenet-e2e/speech-recognition-papers
Towards hot directions in industrial end to end speech recognition
325 19 240
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Language:Python200 6 1856
glory20h/VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
Language:Python166 7 58
ConsistencyVC/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Language:Python140 9 2722
thu-ml/Bridge-TTS
Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).
126 41 41
XuelianCheng/SLT-Net
Implicit Motion Handling for Video Camouflaged Object Detection (CVPR 2022)
Language:Python62 3 1113
ChunmingHe/WS-SAM
Language:Python45 3 61
double22a/asr_nlp_paper_code
Papers of ASR, Tools of ASR
39 1 19
Ash-one/ch_vits
语音合成端到端TTS模型vits中文版，VITS Mandarin
Language:Python15 0 02
cnlinxi/blog
personal blog
Language:HTML14 1 03
qwen-audio/Qwen-Audio
Language:JavaScript3 1 0