gmltmd789
Ph.D. Candidate at Seoul National University, Republic of Korea. Interested in Spoken Language Model, Speech Synthesis, and Generative Model
Seoul National UniversitySeoul, Republic of Korea
Pinned Repositories
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
gmltmd789.github.io
UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
LibriSQA
gmltmd789's Repositories
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
gmltmd789/gmltmd789.github.io