KevinWang676

Speech Synthesis, Video Generation, Diffusion Models, and LLMs

Pinned Repositories

Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook2.8k 33 100407
Bert-VITS2-quick-start
Auto slicing and labeling for Bert-VITS2 | Bert-VITS2自动分割、标注语音及一键训练
Language:Python7 1 01
ChatGLM2-Voice-Cloning
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧：ChatGLM2+声音克隆+视频对话
Language:Python595 10 1592
ControlNet-with-GPT-4
Controllable Text-to-Image Generation with GPT-4
Language:Jupyter Notebook2 2 00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python7 0 07
GPT-SoVITS-emo
Language:Python44 3 03
Improved-SinDDM
Official pytorch implementation of the paper: "SinDDM: A Single Image Denoising Diffusion Model"
Language:Python3 2 00
NeuCoSVC-2
Language:Jupyter Notebook15 1 18
Talking-Face-GPT
ChatGPT with video output
Language:Jupyter Notebook8 2 00
VITS2-Chinese
VITS2 for Chinese speech | 最新VITS2中文语音合成
Language:Python130 5 315

KevinWang676's Repositories

KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook2.8k 33 100407
KevinWang676/GPT-SoVITS-emo
Language:Python44 3 03
KevinWang676/NeuCoSVC-2
Language:Jupyter Notebook15 1 18
KevinWang676/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python7 0 07
KevinWang676/NeuCoSVC-v2
Language:Python6 2 01
KevinWang676/GPT-SoVITS-v2
Language:Python5 3 02
KevinWang676/GPT-SoVITS-VC
GPT-SoVITS with voice conversion
Language:Python4 1 0
KevinWang676/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python2 0 0
KevinWang676/OpenAI-TTS-for-srt
Language:Python2 3 0
KevinWang676/OpenVoice-for-srt
Language:Jupyter Notebook2 2 01
KevinWang676/SenseVoice-onnx
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
Language:Python2 0 0
KevinWang676/DDPM-IP
repo for our ICML 2023 paper "Input Perturbation Reduces Exposure Bias in Diffusion Models"
Language:Jupyter Notebook1 1 0
KevinWang676/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python1 0 01
KevinWang676/fish-speech
Brand new TTS solution
Language:Python1 0 0
KevinWang676/infer_rvc_python
Language:Python1 0 0
KevinWang676/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python1 0 0
KevinWang676/Open-GPT-4o
Language:Python1 2 0
KevinWang676/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python1 0 0
KevinWang676/detail_tts
All generative model in one for better TTS model
Language:Python0 0
KevinWang676/EmoChat
1 0
KevinWang676/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0
KevinWang676/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python0 0
KevinWang676/KevinWang676
My profile
2 0
KevinWang676/KevinWang676.github.io
Personal website
Language:JavaScript1 0
KevinWang676/M4Singer
Language:Python0 0
KevinWang676/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
KevinWang676/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python1 0
KevinWang676/RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
Language:TypeScript0 0
KevinWang676/TCSinger
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
KevinWang676/ttts
Train the next generation of TTS systems.
Language:Python0 0