npujcong

npujcong@gmail.com

bytedanceChina

npujcong's Stars

maum-ai/assem-vc
Official Code for Assem-VC @ICASSP2022
Language:Jupyter Notebook26538
MLNLP-World/Top-AI-Conferences-Paper-with-Code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
2.6k603
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language:Python23047
WelkinYang/Learn2Sing2.0
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Language:JavaScript17826
speechio/chinese_text_normalization
Chinese text normalization for speech processing
Language:Python639147
mli/paper-reading
深度学习经典、新论文逐段精读
27.6k2.5k
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Language:Jupyter Notebook5k321
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language:Python26746
sp-nitech/diffsptk
A differentiable version of SPTK
Language:Python16914
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
Language:Python53051
synesthesiam/opentts
Open Text to Speech Server
Language:Python976138
sony/ai-research-code
Language:Python34966
microsoft/NeuralSpeech
Language:Python1.4k181
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Language:Python13921
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Language:Dockerfile68.3k8.8k
wptoux/albert-chinese-large-webqa
基于百度webqa与dureader数据集训练的Albert Large QA模型
Language:Jupyter Notebook7615
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python971100
SungFeng-Huang/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
Language:Python18836
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.8k701
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.8k432
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++719127
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language:Python13412
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.4k717
wenet-e2e/opencpop
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
21110
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language:Python32441
facebookresearch/vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
Language:Python20727
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
Language:Python41563
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language:Python16931
slhck/ffmpeg-normalize
Audio Normalization for Python/ffmpeg
Language:Python1.3k118
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Language:Python39456

npujcong

npujcong's Stars

maum-ai/assem-vc

MLNLP-World/Top-AI-Conferences-Paper-with-Code

rishikksh20/iSTFTNet-pytorch

WelkinYang/Learn2Sing2.0

speechio/chinese_text_normalization

mli/paper-reading

snakers4/silero-models

maum-ai/univnet

sp-nitech/diffsptk

facebookresearch/textlesslib

synesthesiam/opentts

sony/ai-research-code

microsoft/NeuralSpeech

Rongjiehuang/Multi-Singer

Anduin2017/HowToCook

wptoux/albert-chinese-large-webqa

NATSpeech/NATSpeech

SungFeng-Huang/Meta-TTS

facebookresearch/ConvNeXt

resemble-ai/Resemblyzer

google/visqol

KentoNishi/torch-pitch-shift

MoonInTheRiver/DiffSinger

wenet-e2e/opencpop

keonlee9420/Comprehensive-Transformer-TTS

facebookresearch/vocoder-benchmark

maxrmorrison/torchcrepe

k2kobayashi/crank

slhck/ffmpeg-normalize

facebookresearch/speech-resynthesis