Mikerhinos

Mikerhinos's Stars

Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
Language:Python17k 145 1.5k2k
lllyasviel/Omost
Your image is almost there!
Language:Python7.3k 45 79418
SoftFever/OrcaSlicer
G-code generator for 3D printers (Bambu, Prusa, Voron, VzBot, RatRig, Creality, etc.)
Language:C++7.1k 129 4.6k838
Stability-AI/StableCascade
Official Code for Stable Cascade
Language:Jupyter Notebook6.5k 61 123533
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.5k 53 107458
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Language:Python4.3k 67 142378
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Language:Python3.3k 43 159346
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python2.9k 32 134262
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
Language:Python2.9k 47 172490
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python2.7k 32 62254
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Language:Python2.2k 42 65160
RenderKit/oidn
Intel® Open Image Denoise library
Language:C++1.8k 49 186164
rsxdalv/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Language:TypeScript1.8k 35 231190
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
Language:Python1.4k 18 47138
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.3k 61 228158
BennyKok/comfyui-deploy
An open source `vercel` like deployment platform for Comfy UI
Language:TypeScript1k 13 37134
JonathanFly/bark
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
Language:Jupyter Notebook994 33 9293
yangxy/PASD
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
Language:Python889 10 6961
HumanAIGC/VividTalk
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
774 83 1447
lucidrains/meshgpt-pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Language:Python743 17 7259
williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language:Jupyter Notebook729 12 4171
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python504 11 3050
segmind/segmoe
Language:Python406 7 2424
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
Language:Python401 17 831
TMElyralab/Comfyui-MusePose
Language:Python360 2 5438
pzc163/Comfyui-CatVTON
Language:Jupyter Notebook128 1 1415
kijai/comfyui-svd-temporal-controlnet
Language:Python88 1 03
chflame163/ComfyUI_WordCloud
A ComfyUI plugin for generating word cloud images
Language:JavaScript87 2 105
chenxwh/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python57 1 09
ZHO-ZHO-ZHO/ComfyUI-AnyText
Unofficial implementation of AnyText for ComfyUI（EXP）
Language:Python54 6 44