timegate's Stars
Shenyi-Z/ToCa
Accelerating Diffusion Transformers with Token-wise Feature Caching
mingyuanzhou/SiD
PyTorch code and model checkpoints for Score identity Distillation (SiD) published in ICML 2024
Lightricks/LTX-Video
Official repository for LTX-Video
camenduru/echomimic-jupyter
eldar/flash3d
Official implementation of Flash3D paper
hmrishavbandy/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
mingyuanzhou/SiD-LSG
Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation
votchallenge/toolkit
The official VOT Challenge evaluation and analysis toolkit
yangchris11/samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
CiaraStrawberry/stylecodes
DSaurus/Human4DiT
This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.
USTC3DV/NeRFBlendShape-code
jdh-algo/JoyVASA
a-lgil/pose-depot
A collection of ControlNet poses
zombieyang/sd-ppp
Communicate between Photoshop and SD/SDForge/ComfyUI
akatz-ai/ComfyUI-X-Portrait-Nodes
Wrapper for X-Portrait for running in ComfyUI
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
NIRVANALAN/GaussianAnything
High-quality and editable surfel Gaussian generation through native 3D diffusion.
DrewThomasson/ebook2audiobook
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
NVIDIA/garak
the LLM vulnerability scanner
NVlabs/addit
xiefan-guo/initno
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
leaningtech/webvm
Virtual Machine for the Web
TTPlanetPig/Comfyui_TTP_Toolset
for tile the image for advanced control or modification
TTPlanetPig/Comfyui_Object_Migration
This is a study aim to transfer the single concept by using DIT model self-attention capablity
NJU-PCALab/RAG-Diffusion
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
nekhtiari/image-similarity-measures
:chart_with_upwards_trend: Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.
aim-uofa/StyleDrop-PyTorch
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.