Pinned Repositories
Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
rtx-upscaler
A Gradio-based GUI for enhancing old/low-quality videos with NVIDIA RTX technology. Using Maxine Video Effects SDK, this tool applies AI-powered Super Resolution and Artifact Reduction. Perfect for restoring vintage videos and enhancing low-resolution footage with real-time GPU acceleration.
voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
abus-aikorea's Repositories
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
abus-aikorea/kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
abus-aikorea/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
abus-aikorea/rtx-upscaler
A Gradio-based GUI for enhancing old/low-quality videos with NVIDIA RTX technology. Using Maxine Video Effects SDK, this tool applies AI-powered Super Resolution and Artifact Reduction. Perfect for restoring vintage videos and enhancing low-resolution footage with real-time GPU acceleration.
abus-aikorea/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
abus-aikorea/Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
abus-aikorea/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"