unilight

Assistant professor at Nagoya University, Japan.

Nagoya UniversityNagoya, Japan

unilight's Stars

soumimaiti/speechlmscore_tool
Language:Python252
rinnakk/nue-asr
Nue-ASR inference code by rinna Co., Ltd.
Language:Python261
rinnakk/japanese-pretrained-models
Code for producing Japanese pretrained models provided by rinna Co., Ltd.
Language:Python57641
Takaaki-Saeki/DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
Language:Python805
DigitalPhonetics/VoicePAT
VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
Language:Shell464
Jungjee/RawNet
Official repository for RawNet, RawNet2, and RawNet3
Language:Python34956
MingjieChen/EasyVC
A toolkit for any-to-any encoder-decoder voice conversion systems
Language:Python809
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Language:Python31044
TengyuDeng/lyrics-transcription-with-pitch-onset
Language:Python51
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.2k705
nnsvs/nnsvs
Neural network-based singing voice synthesis library for research
Language:Python67081
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Language:Python36555
facebookresearch/covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
Language:Python32845
ffxiong/uaspeech
Baseline kaldi script for UA-SPEECH corpus
Language:Shell293
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
Language:Python14714
UBC-NLP/L2ASR
Language:Python3
lmnt-com/wavegrad
A fast, high-quality neural vocoder.
Language:Python26746
tarepan/VoiceConversionLab
Collect Voice Conversion researches
Language:TypeScript887
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7k1.2k
dhimasryan/MOSA-Net-Cross-Domain
Language:Python449
nii-yamagishilab/mos-finetune-ssl
Language:Python6318
bshall/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Language:Python31547
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python18.9k2.9k
howard1337/S2VC
Language:Python9617
sky1456723/Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
Language:Python574
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
Language:Cython713118
yistLin/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
Language:Python19538
tzuhsien/Voice-conversion-evaluation
An evaluation toolkit for voice conversion models.
Language:Python395
liusongxiang/ppg-vc
PPG-Based Voice Conversion
Language:Python32173
joonson/syncnet_python
Out of time: automated lip sync in the wild
Language:Python618140

unilight

unilight's Stars

soumimaiti/speechlmscore_tool

rinnakk/nue-asr

rinnakk/japanese-pretrained-models

Takaaki-Saeki/DiscreteSpeechMetrics

DigitalPhonetics/VoicePAT

Jungjee/RawNet

MingjieChen/EasyVC

Rongjiehuang/GenerSpeech

TengyuDeng/lyrics-transcription-with-pitch-onset

MoonInTheRiver/DiffSinger

nnsvs/nnsvs

facebookresearch/speech-resynthesis

facebookresearch/covost

ffxiong/uaspeech

sarulab-speech/UTMOS22

UBC-NLP/L2ASR

lmnt-com/wavegrad

tarepan/VoiceConversionLab

facebookresearch/mae

dhimasryan/MOSA-Net-Cross-Domain

nii-yamagishilab/mos-finetune-ssl

bshall/ZeroSpeech

lucidrains/vit-pytorch

howard1337/S2VC

sky1456723/Pytorch-MBNet

JeremyCCHsu/Python-Wrapper-for-World-Vocoder

yistLin/FragmentVC

tzuhsien/Voice-conversion-evaluation

liusongxiang/ppg-vc

joonson/syncnet_python