ratzycon

ratzycon's Stars

coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.5k 297 1.1k4.5k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30.3k 220 2573k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.5k 673 94979
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
Language:C++13.4k 147 436894
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.1k 173 6722.3k
deep-floyd/IF
Language:Python7.7k 84 101506
rhasspy/piper
A fast, local neural text to speech system
Language:C++7.2k 78 500530
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook6.2k 60 135510
LykosAI/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
Language:C#5.1k 73 751328
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5k 52 328462
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C4.4k 101 1k922
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.7k 42 185319
Bionus/imgbrd-grabber
Very customizable imageboard/booru downloader with powerful filenaming features.
Language:HTML2.6k 100 3.1k221
2Retr0/GodotOceanWaves
FFT-based ocean-wave rendering, implemented in Godot
Language:C#2.2k 12 784
gcui-art/suno-api
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
Language:TypeScript1.6k 38 165376
synesthesiam/opentts
Open Text to Speech Server
Language:Python980 17 45140
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
Language:Python693 21 107133
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Language:Python563 21 6352
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Language:Python281 22 247
huchenlei/sd-webui-api-payload-display
Display the corresponding API payload after each generation on WebUI
Language:JavaScript196 3 2419
Vali-98/XTTS-RVC-UI
A Gradio UI for XTTSv2 and RVC.
Language:Python151 3 2254
uimac/mmdbridge
MikuMikuDance Plugin for All Renderers
Language:C++126 9 1131
alloystorm/dvvr
A versatile character model viewer and motion player that supports a range of model and motion formats including PMX (MMD) & XNALara/XPS models, as well as VMD/BVH motion formats.
Language:Python79 7 3832
Vwing/daggerfall-unity-android
Open source recreation of Daggerfall in the Unity engine, ported to Android
Language:C#59 8 42
EliseWindbloom/audio2vmd
Completely automatically convert audio to vmd lips data with numerous features, using easy 1-click installer. Allowing you to lipsync your mmd models to any song or speech.
Language:Python12 2 31