zhaoyun630
I obtained my PhD degree in physics in Dec 2016. My current research focuses on NLP and ASR.
CloudWalkShanghai, China
Pinned Repositories
2d_phasing_function_lib
It contains many Matlab functions and algorithm implementations for processing 2D crystal dataset in x-ray diffraction experiment.
Algorithm_for_Interview-Chinese
Algorithm for Interview(面试算法笔记-中文)
Artceleration-EEE598-Assn2
asr_dataset
The dataset of Speech Recognition
auraloss
Collection of audio-focused loss functions in PyTorch
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
chinese_text_normalization
Chinese text normalization for speech processing
code_snippets
esim-response-selection
ESIM for Multi-turn Response Selection Task
hybrid-multi-spk-vc
a hybrid multi-speaker voice conversion system
zhaoyun630's Repositories
zhaoyun630/chinese_text_normalization
Chinese text normalization for speech processing
zhaoyun630/hybrid-multi-spk-vc
a hybrid multi-speaker voice conversion system
zhaoyun630/asr_dataset
The dataset of Speech Recognition
zhaoyun630/auraloss
Collection of audio-focused loss functions in PyTorch
zhaoyun630/autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
zhaoyun630/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
zhaoyun630/BaiduSpider
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
zhaoyun630/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
zhaoyun630/COVID-Dialogue
zhaoyun630/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
zhaoyun630/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
zhaoyun630/espnet
End-to-End Speech Processing Toolkit
zhaoyun630/FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
zhaoyun630/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
zhaoyun630/ICASSP2021_paper_list-VC
ICASSP 2021 accepted papers in term of voice conversion (VC)
zhaoyun630/kaldi-onnx
Kaldi model converter to ONNX
zhaoyun630/libtorch_tokenizer
BERT Tokenizer in C++
zhaoyun630/Mengzi
Mengzi Pretrained Models
zhaoyun630/MetaDialog
Platform for few-shot natural language processing: Text Classification, Sequene Labeling.
zhaoyun630/multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
zhaoyun630/nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
zhaoyun630/pointer_summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
zhaoyun630/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
zhaoyun630/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
zhaoyun630/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
zhaoyun630/unilib
Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combining marks stripping.
zhaoyun630/WaveRNN
WaveRNN Vocoder + TTS
zhaoyun630/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
zhaoyun630/x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
zhaoyun630/zhaoyun630.github.io