Pinned Repositories
Multi-modal
cnceleb_data_collector
CN-Celeb3_collector
gitstudy
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
smile-struggler.github.io
smilestruggler.github.io
SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
stable-diffusion
A latent text-to-image diffusion model
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
smile-struggler's Repositories
smile-struggler/CN-Celeb3_collector
smile-struggler/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
smile-struggler/gitstudy
smile-struggler/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
smile-struggler/smile-struggler.github.io
smile-struggler/smilestruggler.github.io
smile-struggler/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
smile-struggler/stable-diffusion
A latent text-to-image diffusion model