Hertin's Stars
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
harlanhong/awesome-talking-head-generation
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
vincentherrmann/pytorch-wavenet
An implementation of WaveNet with fast generation
auspicious3000/contentvec
speech self-supervised representations
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
jason9693/MusicTransformer-pytorch
implementation of music transformer with pytorch (ICLR2019)
festvox/datasets-CMU_Wilderness
CMU Wilderness Multilingual Speech Dataset
auspicious3000/AutoPST
Global Rhythm Style Transfer Without Text Transcriptions
sarulab-speech/jsut-label
context labels and pronunciation data for JSUT corpus
auspicious3000/deepbeam
Deep learning based Speech Beamforming
r9y9/jsut-lab
HTS-style full-context labels for JSUT v1.1