yuan1615

TTS

CVTEGuangZhou

yuan1615's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.8k 396 674k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.5k 204 1.2k3.8k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29k 213 2422.8k
xiaolai/everyone-can-use-english
人人都能用英语
Language:TypeScript20.8k 267 2453.4k
bleedline/aimoneyhunter
ai副业赚钱大集合，教你如何利用ai做一些副业项目，赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.
13.1k 162 201.2k
fishaudio/fish-speech
Brand new TTS solution
Language:Python12.9k 91 366962
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.7k 125 217785
JoeanAmier/TikTokDownloader
TikTok 主页/合辑/直播/视频/图集/原声；抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
Language:Python7.6k 55 2711.2k
Vaibhavs10/insanely-fast-whisper
Language:Jupyter Notebook7.5k 65 189529
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k 63 98508
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python4.8k 78 192393
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
Language:Jupyter Notebook2.7k 48 87307
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python2.7k 30 59252
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.6k 21 8685
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
1.5k 68 5134
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
Language:Python1.3k 18 46135
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python600 9 2228
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python589 15 4142
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Language:Jupyter Notebook476 12 1549
shansongliu/M2UGen
This is the official repository for M2UGen
Language:Jupyter Notebook441 10 1138
haoheliu/voicefixer_main
General Speech Restoration
Language:Python274 11 1854
hayeong0/Diff-HierVC
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
Language:Python189 15 718
zhenye234/CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Language:Python177 12 1118
X-LANCE/StoryTTS
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
Language:HTML132 17 24
Grace9994/CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
Language:Python126 3 1218
thu-ml/Bridge-TTS
Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).
120 40 41
DavidMChan/Anim400K
Anim-400K: A dataset designed from the ground up for automated dubbing of video
97 7 01
0417keito/JEN-1-pytorch
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)
Language:Python51 2 88
neonbjb/pyfastmp3decoder
A fast MP3 decoder for python, using minimp3
Language:Cython26 3 28
google/df-conformer
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
Language:HTML20 2 14

yuan1615

yuan1615's Stars

mlabonne/llm-course

RVC-Boss/GPT-SoVITS

myshell-ai/OpenVoice

xiaolai/everyone-can-use-english

bleedline/aimoneyhunter

fishaudio/fish-speech

InstantID/InstantID

JoeanAmier/TikTokDownloader

Vaibhavs10/insanely-fast-whisper

pytorch-labs/gpt-fast

yl4579/StyleTTS2

ai-forever/Kandinsky-2

facebookresearch/audio2photoreal

baaivision/Emu

csteinmetz1/ai-audio-startups

resemble-ai/resemble-enhance

willisma/SiT

ddlBoJack/emotion2vec

daniilrobnikov/vits2

shansongliu/M2UGen

haoheliu/voicefixer_main

hayeong0/Diff-HierVC

zhenye234/CoMoSpeech

X-LANCE/StoryTTS

Grace9994/CoMoSVC

thu-ml/Bridge-TTS

DavidMChan/Anim400K

0417keito/JEN-1-pytorch

neonbjb/pyfastmp3decoder

google/df-conformer