zhw-zhang's Stars
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
rese1f/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
JingyunLiang/VRT
VRT: A Video Restoration Transformer (official repository)
mit-han-lab/fastcomposer
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
alibaba/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
PeterWang512/GenDataAttribution
Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image influences.
vvictoryuki/FreeDoM
[ICCV 2023] Official PyTorch implementation for the paper "FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model"
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023 Spotlight)
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
ssundaram21/dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight)
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
guoyww/AnimateDiff
Official implementation of AnimateDiff.
text2cinemagraph/text2cinemagraph
Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
kabachuha/sd-webui-text2video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
ShuyUSTC/VidGenMetrics
Quantitative evaluation metrics for video generation
wilson1yan/VideoGPT
pfnet-research/tgan2
The official implementation of "Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN"
voletiv/mcvd-pytorch
Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/2205.09853)
MKFMIKU/vidm
[AAAI23 Oral] Official implementations of Video Implicit Diffusion Models
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
NVlabs/long-video-gan
Official PyTorch implementation of LongVideoGAN
Picsart-AI-Research/VideoINR-Continuous-Space-Time-Super-Resolution
[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.