startreker-shzy

Pinned Repositories

SenseVoice
Multilingual Voice Understanding Model
Language:Python2.8k 37 121264
ICCV2023-MCNET
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Language:Python245 7 2321
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook00
Automatic-Prosody-Annotation
Language:Python00
DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, Warp Markers, and JUCE processors
Language:C++00
spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
Language:Jupyter Notebook00
UniAudio
The Open Source Code of UniAudio
Language:Python00
vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
Language:Python00

startreker-shzy's Repositories

startreker-shzy/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
startreker-shzy/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook00
startreker-shzy/Automatic-Prosody-Annotation
Language:Python00
startreker-shzy/DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, Warp Markers, and JUCE processors
Language:C++00
startreker-shzy/spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
Language:Jupyter Notebook00
startreker-shzy/UniAudio
The Open Source Code of UniAudio
Language:Python00
startreker-shzy/vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
Language:Python00