XinleiNIU

Currently working on generative models on audio synthesis.

The Australian National UniversityCanberra, Australia

Pinned Repositories

AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.3k 45 71179
MusicMagus
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
Language:Python30 3 61
sonyGanFork
This repo contains code for running a pytorch version of GANSynth.
Language:Jupyter Notebook8 4 20
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python602 19 88111
demo-SoundLoCD
This is a demo for our paper 'SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation'
2 0 00
discreteVAE
Language:Python0 0 00
HybridVC-demo
This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'
Language:HTML0 1 00
LatentOptimalPathsBayesianDP
Implementation of Latent Optimal Path by Gumbel Propagation for Variational Bayesian Dynamic Programming
Language:Python4 3 10
SoundMorpher-demo
This is a demo for our paper 'SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model'
Language:HTML00
Toy_data_TTS
A toy dataset for TTS-duration alignment
Language:Python0 0 00

XinleiNIU/LatentOptimalPathsBayesianDP
Implementation of Latent Optimal Path by Gumbel Propagation for Variational Bayesian Dynamic Programming
Language:Python4 3 10
XinleiNIU/demo-SoundLoCD
This is a demo for our paper 'SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation'
2 0 00
XinleiNIU/discreteVAE
Language:Python0 0 00
XinleiNIU/HybridVC-demo
This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'
Language:HTML0 1 00
XinleiNIU/SoundMorpher-demo
This is a demo for our paper 'SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model'
Language:HTML00
XinleiNIU/Toy_data_TTS
A toy dataset for TTS-duration alignment
Language:Python0 0 00