ratzycon's Stars
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
deep-floyd/IF
rhasspy/piper
A fast, local neural text to speech system
vikhyat/moondream
tiny vision language model
LykosAI/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
arcee-ai/mergekit
Tools for merging pretrained large language models.
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Bionus/imgbrd-grabber
Very customizable imageboard/booru downloader with powerful filenaming features.
2Retr0/GodotOceanWaves
FFT-based ocean-wave rendering, implemented in Godot
gcui-art/suno-api
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
synesthesiam/opentts
Open Text to Speech Server
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
huchenlei/sd-webui-api-payload-display
Display the corresponding API payload after each generation on WebUI
Vali-98/XTTS-RVC-UI
A Gradio UI for XTTSv2 and RVC.
uimac/mmdbridge
MikuMikuDance Plugin for All Renderers
alloystorm/dvvr
A versatile character model viewer and motion player that supports a range of model and motion formats including PMX (MMD) & XNALara/XPS models, as well as VMD/BVH motion formats.
Vwing/daggerfall-unity-android
Open source recreation of Daggerfall in the Unity engine, ported to Android
EliseWindbloom/audio2vmd
Completely automatically convert audio to vmd lips data with numerous features, using easy 1-click installer. Allowing you to lipsync your mmd models to any song or speech.