npujcong

npujcong@gmail.com

bytedanceChina

npujcong's Stars

suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.5k 330 4474.3k
chenfei-wu/TaskMatrix
Language:Python34.6k 301 3553.3k
openai/consistency_models
Official repo for consistency models.
Language:Python6.2k 59 53425
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.5k 58 71309
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Language:Python3.4k 120 137495
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Language:Jupyter Notebook3.2k 49 81429
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 87 98419
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python2k 40 43168
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 168 470
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
Language:Python1.5k 23 168170
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.5k 29 93148
Harmonai-org/sample-generator
Tools to train a generative model on arbitrary audio samples
Language:Jupyter Notebook1.1k 46 12173
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
Language:Shell1.1k 10 5688
lucidrains/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Language:Python623 48 2553
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python507 16 2245
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Language:Python498 13 7184
guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
414 13 529
yl4579/StyleTTS
Official Implementation of StyleTTS
Language:Python407 32 7566
pystiche/pystiche
Framework for Neural Style Transfer (NST) built upon PyTorch
Language:Python271 9 6728
interactiveaudiolab/penn
Pitch Estimating Neural Networks (PENN)
Language:Python238 10 1223
152334H/DL-Art-School
TorToiSe fine-tuning with DLAS
Language:Python218 15 65117
ddlBoJack/Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
203 13 014
Zain-Jiang/Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
Language:Python192 9 2419
yl4579/StyleTTS-VC
Official Implementation of StyleTTS-VC
Language:Python164 17 923
hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language:Python162 15 915
xinjli/transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
Language:Python150 13 1115
neonbjb/tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Language:Python145 5 1415
tts-tutorial/book
63 22 11
xrenaa/Retriever
[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"
54 17 22
ictnlp/GMA
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
Language:Python11 1 74

npujcong

npujcong's Stars

suno-ai/bark

chenfei-wu/TaskMatrix

openai/consistency_models

facebookresearch/encodec

openai/improved-diffusion

serp-ai/bark-with-voice-clone

enhuiz/vall-e

archinetai/audio-diffusion-pytorch

archinetai/audio-ai-timeline

DigitalPhonetics/IMS-Toucan

LAION-AI/CLAP

Harmonai-org/sample-generator

TencentGameMate/chinese_speech_pretrain

lucidrains/voicebox-pytorch

ZhangXInFD/SpeechTokenizer

modelscope/KAN-TTS

guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

yl4579/StyleTTS

pystiche/pystiche

interactiveaudiolab/penn

152334H/DL-Art-School

ddlBoJack/Awesome-Speech-Pretraining

Zain-Jiang/Speech-Editing-Toolkit

yl4579/StyleTTS-VC

hhguo/MSMC-TTS

xinjli/transphone

neonbjb/tts-scores

tts-tutorial/book

xrenaa/Retriever

ictnlp/GMA