Pinned Repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
AnyText
APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
HierSpeechpp
The official implementation of HierSpeech++
OpenVoice
Instant voice cloning by MyShell
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
StyleTTS2
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
positivewon's Repositories
positivewon doesn’t have any repository yet.