timegate

timegate's Stars

Shenyi-Z/ToCa
Accelerating Diffusion Transformers with Token-wise Feature Caching
Language:Python241
mingyuanzhou/SiD
PyTorch code and model checkpoints for Score identity Distillation (SiD) published in ICML 2024
Language:Python735
Lightricks/LTX-Video
Official repository for LTX-Video
Language:Python38018
camenduru/echomimic-jupyter
Language:Jupyter Notebook7
eldar/flash3d
Official implementation of Flash3D paper
Language:Python13712
hmrishavbandy/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Language:Python536
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python40636
mingyuanzhou/SiD-LSG
Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation
Language:Python362
votchallenge/toolkit
The official VOT Challenge evaluation and analysis toolkit
Language:Python16446
yangchris11/samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Language:Python1.6k104
CiaraStrawberry/stylecodes
Language:Python29
DSaurus/Human4DiT
This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.
Language:Python45
USTC3DV/NeRFBlendShape-code
Language:Python21418
jdh-algo/JoyVASA
Language:Python31324
a-lgil/pose-depot
A collection of ControlNet poses
Language:Astro1116
zombieyang/sd-ppp
Communicate between Photoshop and SD/SDForge/ComfyUI
Language:Python2719
akatz-ai/ComfyUI-X-Portrait-Nodes
Wrapper for X-Portrait for running in ComfyUI
Language:Python68
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
1.3k41
NIRVANALAN/GaussianAnything
High-quality and editable surfel Gaussian generation through native 3D diffusion.
1322
DrewThomasson/ebook2audiobook
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Language:Python1k98
NVIDIA/garak
the LLM vulnerability scanner
Language:Python2.7k234
NVlabs/addit
1732
xiefan-guo/initno
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Language:Python311
leaningtech/webvm
Virtual Machine for the Web
Language:Svelte10.2k1.5k
TTPlanetPig/Comfyui_TTP_Toolset
for tile the image for advanced control or modification
Language:Python3279
TTPlanetPig/Comfyui_Object_Migration
This is a study aim to transfer the single concept by using DIT model self-attention capablity
Language:Python49117
NJU-PCALab/RAG-Diffusion
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
Language:Python38014
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language:Python1.5k115
nekhtiari/image-similarity-measures
:chart_with_upwards_trend: Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.
Language:Python58868
aim-uofa/StyleDrop-PyTorch
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
Language:Python20313