Pinned Repositories
Algorithm
记录一些常用算法的实现(涵盖常用的数据结构,机器学习以及语音识别中常用算法)
Attentions-in-Tacotron
Automatic-Prosody-Annotation
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Awesome-KBQA
Paper list of KBQA
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
chinese_text_normalization
Chinese text normalization for speech processing
ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
NeMo
Neural Modules: a toolkit for conversational AI
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
MuyangDu's Repositories
MuyangDu/ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
MuyangDu/Algorithm
记录一些常用算法的实现(涵盖常用的数据结构,机器学习以及语音识别中常用算法)
MuyangDu/Attentions-in-Tacotron
MuyangDu/Automatic-Prosody-Annotation
MuyangDu/Awesome-KBQA
Paper list of KBQA
MuyangDu/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
MuyangDu/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
MuyangDu/Efficient-Incremental-TTS-on-GPUs
MuyangDu/fastmoe
A fast MoE impl for PyTorch
MuyangDu/g2p
g2p: English Grapheme To Phoneme Conversion
MuyangDu/g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch
MuyangDu/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
MuyangDu/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
MuyangDu/GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)
MuyangDu/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
MuyangDu/HiFi-TTS-Duration-Extractor
MuyangDu/incremental-fastpitch
MuyangDu/InstantSpeech
MuyangDu/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
MuyangDu/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
MuyangDu/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
MuyangDu/pytorchltr
Learning to Rank in PyTorch
MuyangDu/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
MuyangDu/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
MuyangDu/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
MuyangDu/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
MuyangDu/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
MuyangDu/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
MuyangDu/WaveRNN-Heuristic-Dynamic-Blending
MuyangDu/WenetSpeechSpeakerCluster