yangdongchao
A PhD Student from The Chinese University of Hong Kong, currently working on multi-modal audio foundation models and Chinese traditional philosophy.
CUHK&PKU&SHU
Pinned Repositories
AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
InstructTTS
The deme page of InstructTTS
LLM-Codec
The open source code for LLM-Codec
RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
SimpleSpeech
The open source code for SimpleSpeech series
SoundStorm
The reproduced code for Google's SoundStorm
Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
text-to-sound-synthesis-demo
This is a demo webpage for our paper 'text-to-sound synthesis'
UniAudio
The Open Source Code of UniAudio
yangdongchao's Repositories
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
yangdongchao/UniAudio
The Open Source Code of UniAudio
yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
yangdongchao/SoundStorm
The reproduced code for Google's SoundStorm
yangdongchao/InstructTTS
The deme page of InstructTTS
yangdongchao/RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
yangdongchao/text-to-sound-synthesis-demo
This is a demo webpage for our paper 'text-to-sound synthesis'
yangdongchao/SimpleSpeech
The open source code for SimpleSpeech series
yangdongchao/LLM-Codec
The open source code for LLM-Codec
yangdongchao/UniAudio_demo
The demo page of UniAudio
yangdongchao/SimpleSpeech2_demo
yangdongchao/NoreSpeech
The source code of NoreSpeech
yangdongchao/weakly-target-sound-detection
A Two-student Learning Framework for Mixed Supervised Target Sound Detection
yangdongchao/RaDur
The source code of RaDur
yangdongchao/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
yangdongchao/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
yangdongchao/beautiful-jekyll
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
yangdongchao/ChatGPT
OpenAI API Free Reverse Proxy
yangdongchao/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
yangdongchao/EdVAE
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
yangdongchao/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
yangdongchao/Make-An-Audio
yangdongchao/NoreSpeech_demo
yangdongchao/opengpts
yangdongchao/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
yangdongchao/TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
yangdongchao/tts-qa
yangdongchao/vit-vqgan-jax
Jax implementation of VIT-VQGAN
yangdongchao/yangdongchao
Config files for my GitHub profile.
yangdongchao/yangdongchao.github.io
Personal Homepage