npujcong's Stars
maum-ai/assem-vc
Official Code for Assem-VC @ICASSP2022
MLNLP-World/Top-AI-Conferences-Paper-with-Code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
WelkinYang/Learn2Sing2.0
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
speechio/chinese_text_normalization
Chinese text normalization for speech processing
mli/paper-reading
深度学习经典、新论文逐段精读
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
sp-nitech/diffsptk
A differentiable version of SPTK
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
synesthesiam/opentts
Open Text to Speech Server
sony/ai-research-code
microsoft/NeuralSpeech
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
wptoux/albert-chinese-large-webqa
基于百度webqa与dureader数据集训练的Albert Large QA模型
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
SungFeng-Huang/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
facebookresearch/ConvNeXt
Code release for ConvNeXt model
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
google/visqol
Perceptual Quality Estimator for speech and audio
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
wenet-e2e/opencpop
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
facebookresearch/vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
slhck/ffmpeg-normalize
Audio Normalization for Python/ffmpeg
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.