zhw-zhang

zhw-zhang's Stars

sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Language:Python4.9k587
rese1f/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Language:Python1.3k85
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python4.8k390
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook4.5k291
JingyunLiang/VRT
VRT: A Video Restoration Transformer (official repository)
Language:Python1.3k122
mit-han-lab/fastcomposer
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Language:Python62134
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.2k906
alibaba/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Language:Python2k251
PeterWang512/GenDataAttribution
Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image influences.
Language:Python572
vvictoryuki/FreeDoM
[ICCV 2023] Official PyTorch implementation for the paper "FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model"
Language:Python2519
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python9.8k1.5k
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook1.7k227
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.5k134
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023 Spotlight)
Language:Python63635
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python3.8k359
ssundaram21/dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight)
Language:Python32016
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python11.8k1k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python9.7k794
text2cinemagraph/text2cinemagraph
Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]
Language:Python35443
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Language:Python130k9.8k
kabachuha/sd-webui-text2video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Language:Python1.3k109
ShuyUSTC/VidGenMetrics
Quantitative evaluation metrics for video generation
Language:Python2
wilson1yan/VideoGPT
Language:Jupyter Notebook940107
pfnet-research/tgan2
The official implementation of "Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN"
Language:Python7611
voletiv/mcvd-pytorch
Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/2205.09853)
Language:Python31125
MKFMIKU/vidm
[AAAI23 Oral] Official implementations of Video Implicit Diffusion Models
Language:Python654
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Language:Python33335
NVlabs/long-video-gan
Official PyTorch implementation of LongVideoGAN
Language:Python30325
Picsart-AI-Research/VideoINR-Continuous-Space-Time-Super-Resolution
[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
Language:Python26127
AaronFeng753/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
Language:C++12.4k856