Pinned Repositories
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
DragAnything
Official code for 'DragAnything: Motion Control for Anything using Entity Representation'
Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
X-Adapter
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
Show Lab's Repositories
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
showlab/MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
showlab/X-Adapter
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
showlab/DragAnything
Official code for 'DragAnything: Motion Control for Anything using Entity Representation'
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
showlab/BoxDiff
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
showlab/EgoVLP
[NeurIPS2022] Egocentric Video-Language Pretraining
showlab/VisorGPT
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
showlab/T2VScore
T2VScore: Towards A Better Metric for Text-to-Video Generation
showlab/cosmo
showlab/sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
showlab/CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
showlab/ShowRoom3D
This is the project page of ShowRoom3D
showlab/Long-form-Video-Prior
showlab/Efficient-CLS
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
showlab/BYOC
[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters
showlab/assistgui
showlab/LOVA3
The official repo of "Learning to Visual Question Answering, Asking and Assessment"
showlab/VisInContext
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
showlab/RingID
showlab/Tune-An-Ellipse
[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want
showlab/DynVideo-E
This is the project page for DynVideo-E.
showlab/magicanimate
showlab/cvpr2024-tutorial-video-diffusion-models
showlab/AssistGaze
showlab/GUI-Action-Narrator
Repository of GUI Action Narrator
showlab/Moonshot
showlab/videogui