Pinned Repositories
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
X-Adapter
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
Show Lab's Repositories
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
showlab/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
showlab/DatasetDM
[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models
showlab/all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
showlab/DeVRF
The Pytorch implementation of "DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes"
showlab/ShowAnything
showlab/loveu-tgve-2023
Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.
showlab/assistgpt
showlab/datacentric.vlp
Compress conventional Vision-Language Pre-training data
showlab/Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
showlab/mist
showlab/Awesome-Long-Context
A curated list of resources about long-context in large-language models and video understanding.
showlab/ShowRoom3D
This is the project page of ShowRoom3D
showlab/Q2A
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
showlab/Efficient-CLS
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
showlab/GEB-Plus
[ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval
showlab/HOSNeRF
This is the project page for the HOSNeRF
showlab/AVA-AVD
showlab/Show-Anything-3D
Edit and Generate Anything in 3D world!
showlab/SCT
[IJCV2023] Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"
showlab/SOIS
The Pytorch implementation of "Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization"
showlab/ColonNeRF
This is the project page for ColonNeRF.
showlab/DynVideo-E
This is the project page for DynVideo-E.
showlab/TTC-Tuning
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
showlab/xagen
showlab/pv3d
showlab/Mix-of-Show
showlab/Moonshot