Pinned Repositories
AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
DL_special_topics
Lecture
LPC_Speech_Synthesis
Speech synthesis using LPC
lvc-vc
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
MB-iSTFT-VITS-with-AutoVocoder
Incorporating AutoVocoder to MB-iSTFT-VITS
SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
SC-VITS
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
SNAC
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
StyleSpeech
Unofficial Pytorch Implementation of StyleSpeech
TransferTTS
TransferTTS (Zero-Shot learning of VITS)
hcy71o's Repositories
hcy71o/TransferTTS
TransferTTS (Zero-Shot learning of VITS)
hcy71o/AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
hcy71o/SNAC
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
hcy71o/MB-iSTFT-VITS-with-AutoVocoder
Incorporating AutoVocoder to MB-iSTFT-VITS
hcy71o/SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
hcy71o/SC-VITS
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
hcy71o/LPC_Speech_Synthesis
Speech synthesis using LPC
hcy71o/DL_special_topics
Lecture
hcy71o/StyleSpeech
Unofficial Pytorch Implementation of StyleSpeech
hcy71o/LAFMA
hcy71o/lvc-vc
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
hcy71o/AILTTS_demo
hcy71o/hcyspeech.github.io
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
hcy71o/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
hcy71o/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
hcy71o/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
hcy71o/FlashSpeech
FlashSpeech: Efficient Zero-Shot Speech Synthesis
hcy71o/flux
Official inference repo for FLUX.1 models
hcy71o/hanja
한글, 한자 라이브러리
hcy71o/monotonic_align
Monotonic Alignment Search
hcy71o/NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
hcy71o/PeriodWave
The official Implementation of PeriodWave and PeriodWave-Turbo
hcy71o/phonemizer
Simple text to phones converter for multiple languages
hcy71o/SC-CNN-demo
Demo page for SC-CNN
hcy71o/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
hcy71o/SparseTTS-demo
Demo page for SparseTTS
hcy71o/ssp-features
Source-filter 파라미터 아카이빙용
hcy71o/starter_kit
Start building full stack dApps fast with this starter kit!
hcy71o/TEST
hcy71o/TFGAN
GAN-based vocoder focusing on time and frequency features respectively