Robinatp

Robinatp's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.4k 557 71410.2k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.2k 446 3135.1k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.2k 206 2.3k2.5k
openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Language:Python2.7k 36 104287
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.5k 42 107222
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 170 469
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
1.6k 69 5137
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 53 31101
PlayVoice/lora-svc
singing voice change based on whisper, and lora for singing voice clone
Language:Python628 24 6979
zhangyongmao/VISinger2
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Language:Python321 12 2342
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python320 10 2744
xunmengshe/OpenUtau
Language:C#269 4 627
YatingMusic/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
Language:Python254 7 737
adelacvg/NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
Language:Python232 19 3712
M4Singer/M4Singer
Language:Python193 10 1516
ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language:Python149 4 519
CODEJIN/NaturalSpeech2
Language:Jupyter Notebook140 13 1215
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Language:Python136 10 1011
lesterphillip/SVCC23_FastSVC
Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation
Language:Python111 7 1110
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Language:Python72 2 314
yousa-ling-official-production/yousa-ling-diffsinger-v1
泠鸢yousa的Diffsinger模型v1版
42 1 14
openvpi/DiffSingerMiniEngine
A minimum inference engine for DiffSinger
Language:Python34 2 18
timedomain-tech/ACE_phonemes
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
Language:Python32 3 07
seyong92/phoneme-informed-note-level-singing-transcription
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
Language:Python24 5 22
fishaudio/OpenUtau
OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器，使用方法：https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95%EF%BC%88%E4%B8%AD%E6%96%87%EF%BC%89
Language:C#23 1 01
chomeyama/UnifiedSourceFilterGAN
Language:Python19 2 05
timedomain-tech/ACE_sequence_file
Open-source file format designed for high-quality, customizable singing synthesis.
Language:Python11 5 05
A-Quarter-Mile/PHONEix
PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor
5 1 01
MaxMax2016/EasyVC
变声技术综合评比
Language:Python1 0 02
Robinatp/SECaps
Language:Python1 0 0