Pinned Repositories
SenseVoice
Multilingual Voice Understanding Model
ICCV2023-MCNET
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Automatic-Prosody-Annotation
DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, Warp Markers, and JUCE processors
spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
UniAudio
The Open Source Code of UniAudio
vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
startreker-shzy's Repositories
startreker-shzy/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
startreker-shzy/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
startreker-shzy/Automatic-Prosody-Annotation
startreker-shzy/DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, Warp Markers, and JUCE processors
startreker-shzy/spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
startreker-shzy/UniAudio
The Open Source Code of UniAudio
startreker-shzy/vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!