unilight's Stars
soumimaiti/speechlmscore_tool
rinnakk/nue-asr
Nue-ASR inference code by rinna Co., Ltd.
rinnakk/japanese-pretrained-models
Code for producing Japanese pretrained models provided by rinna Co., Ltd.
Takaaki-Saeki/DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
DigitalPhonetics/VoicePAT
VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
Jungjee/RawNet
Official repository for RawNet, RawNet2, and RawNet3
MingjieChen/EasyVC
A toolkit for any-to-any encoder-decoder voice conversion systems
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
TengyuDeng/lyrics-transcription-with-pitch-onset
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
nnsvs/nnsvs
Neural network-based singing voice synthesis library for research
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
facebookresearch/covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
ffxiong/uaspeech
Baseline kaldi script for UA-SPEECH corpus
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
UBC-NLP/L2ASR
lmnt-com/wavegrad
A fast, high-quality neural vocoder.
tarepan/VoiceConversionLab
Collect Voice Conversion researches
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
dhimasryan/MOSA-Net-Cross-Domain
nii-yamagishilab/mos-finetune-ssl
bshall/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
howard1337/S2VC
sky1456723/Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
yistLin/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
tzuhsien/Voice-conversion-evaluation
An evaluation toolkit for voice conversion models.
liusongxiang/ppg-vc
PPG-Based Voice Conversion
joonson/syncnet_python
Out of time: automated lip sync in the wild