Pinned Repositories
MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
glee-vision.github.io
Sparse4D
RYHSmmc.github.io
APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
DatasetDM
[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models
MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
SED
[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.