sarulab-speech

UTokyo-SaruLab Speech Research Group at The University of Tokyo, Japan.

Tokyo, Japan

Pinned Repositories

Coco-Nut
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
21 1 00
jsut-label
context labels and pronunciation data for JSUT corpus
67 5 19
jtubespeech
Language:Python213 10 846
lightweight_spkr_anon
Lightweight speaker anonymization [IEEE SLT2021]
Language:Python26 5 011
multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
Language:Python24 4 02
tdmelodic_openjtalk
tdmelodic for open-jtalk
22 3 11
UTMOS22
UT-Sarulab MOS prediction system using SSL models
Language:Python188 7 1114
UTMOSv2
UTokyo-SaruLab MOS Prediction System
Language:Python97 5 58
whisper-asr-finetune
Language:Python32 4 58
xvector_jtubespeech
xvector model on jtubespeech
Language:Python40 4 24

sarulab-speech's Repositories

sarulab-speech/jtubespeech
Language:Python213 10 846
sarulab-speech/UTMOS22
UT-Sarulab MOS prediction system using SSL models
Language:Python188 7 1114
sarulab-speech/UTMOSv2
UTokyo-SaruLab MOS Prediction System
Language:Python97 5 58
sarulab-speech/jsut-label
context labels and pronunciation data for JSUT corpus
67 5 19
sarulab-speech/xvector_jtubespeech
xvector model on jtubespeech
Language:Python40 4 24
sarulab-speech/whisper-asr-finetune
Language:Python32 4 58
sarulab-speech/lightweight_spkr_anon
Lightweight speaker anonymization [IEEE SLT2021]
Language:Python26 5 011
sarulab-speech/multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
Language:Python24 4 02
sarulab-speech/tdmelodic_openjtalk
tdmelodic for open-jtalk
22 3 11
sarulab-speech/Coco-Nut
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
21 1 00
sarulab-speech/spatial_voice_conversion
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
Language:Python14 2 11
sarulab-speech/ml-audiocaps
Multi-lingual AudioCaps
7 2 00
sarulab-speech/VMC2024-sarulab-data
7 2 0
sarulab-speech/SaSLaW
Dialogue Speech Corpus with Audio-visual Egocentric Information, "So, what are you Speaking, Listening, and Watching?"
Language:Python5 2 00
sarulab-speech/visual-onoma-to-wave
Visual onoma-to-wave official implementation
Language:Python5 1 00
sarulab-speech/Mid-Attribute-Speaker-Generation
Language:Python2 1 01
sarulab-speech/pseudo_speech_decryption
Language:Python1 3 00
sarulab-speech/demo_CALLS_corpus
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center (INTERSPEECH2023)
Language:HTML0 1 00
sarulab-speech/demo_ChatGPT_EDSS
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings (INTERSPEECH2023)
Language:HTML0 1 00
sarulab-speech/bert-japanese
BERT models for Japanese text.
Language:Python1 0
sarulab-speech/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0
sarulab-speech/yodas-transcription
Modified transcriptions of YODAS dataset
1 0

sarulab-speech

Pinned Repositories

Coco-Nut

jsut-label

jtubespeech

lightweight_spkr_anon

multi-speaker-dgp

tdmelodic_openjtalk

UTMOS22

UTMOSv2

whisper-asr-finetune

xvector_jtubespeech

sarulab-speech's Repositories

sarulab-speech/jtubespeech

sarulab-speech/UTMOS22

sarulab-speech/UTMOSv2

sarulab-speech/jsut-label

sarulab-speech/xvector_jtubespeech

sarulab-speech/whisper-asr-finetune

sarulab-speech/lightweight_spkr_anon

sarulab-speech/multi-speaker-dgp

sarulab-speech/tdmelodic_openjtalk

sarulab-speech/Coco-Nut

sarulab-speech/spatial_voice_conversion

sarulab-speech/ml-audiocaps

sarulab-speech/VMC2024-sarulab-data

sarulab-speech/SaSLaW

sarulab-speech/visual-onoma-to-wave

sarulab-speech/Mid-Attribute-Speaker-Generation

sarulab-speech/pseudo_speech_decryption

sarulab-speech/demo_CALLS_corpus

sarulab-speech/demo_ChatGPT_EDSS

sarulab-speech/bert-japanese

sarulab-speech/fairseq

sarulab-speech/yodas-transcription