XinleiNIU
Currently working on generative models on audio synthesis.
The Australian National UniversityCanberra, Australia
Pinned Repositories
AudioLDM2
Text-to-Audio/Music Generation
MusicMagus
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
sonyGanFork
This repo contains code for running a pytorch version of GANSynth.
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
demo-SoundLoCD
This is a demo for our paper 'SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation'
discreteVAE
HybridVC-demo
This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'
LatentOptimalPathsBayesianDP
Implementation of Latent Optimal Path by Gumbel Propagation for Variational Bayesian Dynamic Programming
SoundMorpher-demo
This is a demo for our paper 'SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model'
Toy_data_TTS
A toy dataset for TTS-duration alignment
XinleiNIU's Repositories
XinleiNIU/LatentOptimalPathsBayesianDP
Implementation of Latent Optimal Path by Gumbel Propagation for Variational Bayesian Dynamic Programming
XinleiNIU/demo-SoundLoCD
This is a demo for our paper 'SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation'
XinleiNIU/discreteVAE
XinleiNIU/HybridVC-demo
This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'
XinleiNIU/SoundMorpher-demo
This is a demo for our paper 'SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model'
XinleiNIU/Toy_data_TTS
A toy dataset for TTS-duration alignment