cywang97
Hi, I am a 4th year joint PhD student of Nankai University and Microsoft Research Asia. Working on end-to-end ASR and Speech Translation.
Nankai UniversityBeijing, China
cywang97's Stars
labuladong/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
openai/openai-cookbook
Examples and guides for using the OpenAI API
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
ivy-llc/ivy
Convert Machine Learning Code Between Frameworks
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
facebookresearch/metaseq
Repo for external large-scale work
microsoft/DeepSpeedExamples
Example models using DeepSpeed
SuperCV/Book
:green_book:我的个人书籍学习和收藏
ivan-bilan/The-NLP-Pandect
A comprehensive reference for all topics related to Natural Language Processing
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
hirofumi0810/neural_sp
End-to-end ASR/LM implementation with PyTorch
karpathy/deep-vector-quantization
VQVAEs, GumbelSoftmaxes and friends
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
bshall/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
cywang97/StreamingTransformer
swasun/VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
cientgu/GIQA
Pytorch implementation of Generated Image Quality Assessment
cientgu/Mask_Guided_Portrait_Editing
pytorch implementation of "Mask-Guided Portrait Editing with Conditional GANs"
bshall/VectorQuantizedCPC
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
MingjieChen/wavenet_autoencoders
WaveNet auto-ancoders for ZeroSpeech challenge 2020