ygyuan

Northwestern Polytechnical UniversityXi'an

ygyuan's Stars

myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30.2k 217 2543k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python26.2k 179 1304.9k
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python25.5k 180 1.7k3.7k
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
Language:Python9k 136 5851.1k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python8.8k 84 639848
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python8.3k 79 4441.1k
kyutai-labs/moshi
Language:Python7k 80 91550
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k 73 1k799
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.9k 44 488469
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.8k 41 160341
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.7k 66 104304
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language:C++3.5k 60 719310
ymcui/Chinese-ELECTRA
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
Language:Python1.4k 26 86171
innnky/emotional-vits
无需情感标注的情感可控语音合成模型，基于VITS
Language:Jupyter Notebook1.3k 12 34167
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language:Python924 22 5753
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python923 70 0111
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Language:Python921 14 154122
Plachtaa/seed-vc
zero-shot voice conversion & singing voice conversion, with real-time support
Language:Python813 28 8499
lifeiteng/OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Language:Python770 9 1030
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python505 31 2336
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Language:Python394 19 2056
liutaocode/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Language:Python322 43 022
thuhcsi/NeuCoSVC
Language:Python263 6 940
hayeong0/DDDM-VC
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
Language:Python204 16 1921
xingchensong/S3Tokenizer
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
Language:Python189 7 421
huangxu1991/GPT-SoVITS-VC
VC Without Retrain!
Language:Python107 4 06
hhguo/SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
Language:Python75 7 74
LetterLiGo/SafeEar
SafeEar: Content Privacy-Preserving Audio Deepfake Detection (Accepted by CCS 2024)
Language:Python64 1 159
smileslab/Comparative-Analysis-Voice-Spoofing
A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.
Language:MATLAB14 1 12
ppmzhang2/seed-vc
zero-shot voice conversion with in context learning
Language:Python2 0 00

ygyuan

ygyuan's Stars

myshell-ai/OpenVoice

svc-develop-team/so-vits-svc

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

jiaaro/pydub

FunAudioLLM/CosyVoice

SWivid/F5-TTS

kyutai-labs/moshi

pyannote/pyannote-audio

AILab-CVC/YOLO-World

FunAudioLLM/SenseVoice

huggingface/distil-whisper

OpenNMT/CTranslate2

ymcui/Chinese-ELECTRA

innnky/emotional-vits

jishengpeng/WavTokenizer

NVIDIA/BigVGAN

lenML/Speech-AI-Forge

Plachtaa/seed-vc

lifeiteng/OmniSenseVoice

FireRedTeam/FireRedTTS

facebookresearch/speech-resynthesis

liutaocode/TTS-arxiv-daily

thuhcsi/NeuCoSVC

hayeong0/DDDM-VC

xingchensong/S3Tokenizer

huangxu1991/GPT-SoVITS-VC

hhguo/SoCodec

LetterLiGo/SafeEar

smileslab/Comparative-Analysis-Voice-Spoofing

ppmzhang2/seed-vc