yangdongchao

A PhD Student from The Chinese University of Hong Kong, currently working on multi-modal audio foundation models and Chinese traditional philosophy.

CUHK&PKU&SHU

Pinned Repositories

AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10.1k 137 51868
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python605 31 4080
InstructTTS
The deme page of InstructTTS
155 13 28
LLM-Codec
The open source code for LLM-Codec
Language:Python118 13 55
RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
Language:Python126 11 411
SimpleSpeech
The open source code for SimpleSpeech series
Language:Python120 8 76
SoundStorm
The reproduced code for Google's SoundStorm
Language:Python261 20 2719
Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
Language:Python352 17 2733
text-to-sound-synthesis-demo
This is a demo webpage for our paper 'text-to-sound synthesis'
126 2 06
UniAudio
The Open Source Code of UniAudio
Language:Python536 36 3332

yangdongchao's Repositories

yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python605 31 4080
yangdongchao/UniAudio
The Open Source Code of UniAudio
Language:Python536 36 3332
yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
Language:Python352 17 2733
yangdongchao/SoundStorm
The reproduced code for Google's SoundStorm
Language:Python261 20 2719
yangdongchao/InstructTTS
The deme page of InstructTTS
155 13 28
yangdongchao/RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
Language:Python126 11 411
yangdongchao/text-to-sound-synthesis-demo
This is a demo webpage for our paper 'text-to-sound synthesis'
126 2 06
yangdongchao/SimpleSpeech
The open source code for SimpleSpeech series
Language:Python120 8 76
yangdongchao/LLM-Codec
The open source code for LLM-Codec
Language:Python118 13 55
yangdongchao/UniAudio_demo
The demo page of UniAudio
34 3 04
yangdongchao/SimpleSpeech2_demo
Language:Python6 3 0
yangdongchao/NoreSpeech
The source code of NoreSpeech
4 5 10
yangdongchao/weakly-target-sound-detection
A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Language:Python4 2 00
yangdongchao/RaDur
The source code of RaDur
Language:Python3 2 0
yangdongchao/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Language:Python1 0 0
yangdongchao/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python1 0
yangdongchao/beautiful-jekyll
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
Language:HTML1 0
yangdongchao/ChatGPT
OpenAI API Free Reverse Proxy
Language:TypeScript0 0
yangdongchao/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python1 0
yangdongchao/EdVAE
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
Language:Python0 0
yangdongchao/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python1 0
yangdongchao/Make-An-Audio
Language:Python0 0
yangdongchao/NoreSpeech_demo
2 01
yangdongchao/opengpts
Language:Rich Text Format0 0
yangdongchao/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook0 0
yangdongchao/TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
Language:Python1 0
yangdongchao/tts-qa
Language:Python0 0
yangdongchao/vit-vqgan-jax
Jax implementation of VIT-VQGAN
Language:Python0 0
yangdongchao/yangdongchao
Config files for my GitHub profile.
2 0
yangdongchao/yangdongchao.github.io
Personal Homepage
Language:JavaScript2 0