timegate's Stars
xiefan-guo/initno
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
leaningtech/webvm
Virtual Machine for the Web
TTPlanetPig/Comfyui_TTP_Toolset
for tile the image for advanced control or modification
TTPlanetPig/Comfyui_Object_Migration
This is a study aim to transfer the single concept by using DIT model self-attention capablity
NJU-PCALab/RAG-Diffusion
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
nekhtiari/image-similarity-measures
:chart_with_upwards_trend: Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.
aim-uofa/StyleDrop-PyTorch
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
ingra14m/Specular-Gaussians
[NeurIPS 2024] Official implementation of "Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting"
ShmuelRonen/ComfyUI-FreeMemory
ComfyUI-FreeMemory is a custom node extension for ComfyUI that provides memory management capabilities within your image generation workflows.
naver-ai/ZIM
ZIM: Zero-Shot Image Matting for Anything
xinge008/Cylinder3D
Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)
MS-Diffusion/MS-Diffusion
Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
hyz317/StdGEN
hanweikung/face_anon_simple
[WACV 2025] Official implementation of "Face Anonymization Made Simple"
dockur/macos
OSX (macOS) inside a Docker container.
mikkel/ComfyUI-text-overlay
Overlay text on an image in ComfyUI with font/alignment/placement customization
DiffPoseTalk/DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
ntegrals/aura-voice
Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.
neu-vi/SMooDi
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
PKU-VCL-Geometry/GeoSplatting
jags111/efficiency-nodes-comfyui
A collection of ComfyUI custom nodes.- Awesome smart way to work with nodes!
VrchStudio/comfyui-web-viewer
ComfyUI custom nodes and web utilities for real-time AI generation and interaction
Tencent/Hunyuan3D-1
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
erosDiffusion/ComfyUI-enricos-nodes
Compositor Node experiments
xdit-project/mochi-xdit
faster parallel inference of mochi-1 video generation model
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Kmcode1/SG-I2V
This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.
VideoVerses/VideoTuna
Let's finetune video generation models!