Rongjiehuang
Focusing on multimodal synthesis (speech/audio/sing), speech translation, and self-supervised learning.
Facebook AI Research (FAIR)
Pinned Repositories
AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
awesome-speech-to-speech-translation
List of direct speech-to-speech translation papers.
FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Rongjiehuang's Repositories
Rongjiehuang/ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Rongjiehuang/TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Rongjiehuang/awesome-speech-to-speech-translation
List of direct speech-to-speech translation papers.
Rongjiehuang/Multiband-WaveRNN
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/
Rongjiehuang/SingGAN
Project page for SingGAN (ACM-MM' 2022): Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjiehuang/WaterCo
基于Mask R-CNN的水下垃圾检测
Rongjiehuang/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Rongjiehuang/AudioGPT
Rongjiehuang/Rongjiehuang
Rongjiehuang/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
Rongjiehuang/UniAudio
The Open Source Code of UniAudio
Rongjiehuang/2019_algorithm_intern_information
2020年的算法实习岗位/校招公司信息表,部分包括内推码,和常见深度学习算法岗面试题及答案,暑期计算机视觉实习面经和总结
Rongjiehuang/955.WLB
955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
Rongjiehuang/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
Rongjiehuang/awesome-courses
:books: List of awesome university courses for learning Computer Science!
Rongjiehuang/daily-paper-computer-vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
Rongjiehuang/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Rongjiehuang/leetcode
LeetCode题解,151道题完整版
Rongjiehuang/PaddleSpeech
An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
Rongjiehuang/pumpkin-book
《机器学习》(西瓜书)公式推导解析,在线阅读地址:https://datawhalechina.github.io/pumpkin-book
Rongjiehuang/rongjiehuang.github.io
Personal Homepage
Rongjiehuang/tips_for_interview
我的一些面试心得;自学CS历程分享;找工作经验分享
Rongjiehuang/wait_rongjiehuang.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
Rongjiehuang/zju-icicles
浙江大学课程攻略共享计划
Rongjiehuang/Awesome-algorithm-interview
算法工程师(人工智能CV方向)面试问题及相关资料
Rongjiehuang/code-of-learn-deep-learning-with-pytorch
This is code of book "Learn Deep Learning with PyTorch"
Rongjiehuang/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.