Mikerhinos's Stars
Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
lllyasviel/Omost
Your image is almost there!
SoftFever/OrcaSlicer
G-code generator for 3D printers (Bambu, Prusa, Voron, VzBot, RatRig, Creality, etc.)
Stability-AI/StableCascade
Official Code for Stable Cascade
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
RenderKit/oidn
Intel® Open Image Denoise library
rsxdalv/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
BennyKok/comfyui-deploy
An open source `vercel` like deployment platform for Comfy UI
JonathanFly/bark
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
yangxy/PASD
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
HumanAIGC/VividTalk
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
lucidrains/meshgpt-pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
segmind/segmoe
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
TMElyralab/Comfyui-MusePose
pzc163/Comfyui-CatVTON
kijai/comfyui-svd-temporal-controlnet
chflame163/ComfyUI_WordCloud
A ComfyUI plugin for generating word cloud images
chenxwh/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
ZHO-ZHO-ZHO/ComfyUI-AnyText
Unofficial implementation of AnyText for ComfyUI(EXP)