Pinned Repositories
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Bert-VITS2-quick-start
Auto slicing and labeling for Bert-VITS2 | Bert-VITS2自动分割、标注语音及一键训练
ChatGLM2-Voice-Cloning
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话
ControlNet-with-GPT-4
Controllable Text-to-Image Generation with GPT-4
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
GPT-SoVITS-emo
Improved-SinDDM
Official pytorch implementation of the paper: "SinDDM: A Single Image Denoising Diffusion Model"
NeuCoSVC-2
Talking-Face-GPT
ChatGPT with video output
VITS2-Chinese
VITS2 for Chinese speech | 最新VITS2中文语音合成
KevinWang676's Repositories
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
KevinWang676/GPT-SoVITS-emo
KevinWang676/NeuCoSVC-2
KevinWang676/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
KevinWang676/NeuCoSVC-v2
KevinWang676/GPT-SoVITS-v2
KevinWang676/GPT-SoVITS-VC
GPT-SoVITS with voice conversion
KevinWang676/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
KevinWang676/OpenAI-TTS-for-srt
KevinWang676/OpenVoice-for-srt
KevinWang676/SenseVoice-onnx
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
KevinWang676/DDPM-IP
repo for our ICML 2023 paper "Input Perturbation Reduces Exposure Bias in Diffusion Models"
KevinWang676/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
KevinWang676/fish-speech
Brand new TTS solution
KevinWang676/infer_rvc_python
KevinWang676/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
KevinWang676/Open-GPT-4o
KevinWang676/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
KevinWang676/detail_tts
All generative model in one for better TTS model
KevinWang676/EmoChat
KevinWang676/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
KevinWang676/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
KevinWang676/KevinWang676
My profile
KevinWang676/KevinWang676.github.io
Personal website
KevinWang676/M4Singer
KevinWang676/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
KevinWang676/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
KevinWang676/RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
KevinWang676/TCSinger
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
KevinWang676/ttts
Train the next generation of TTS systems.