Pinned Repositories
BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
BoundaryFormer
Code for CVPR2022 paper: Instance Segmentation with Mask-supervised Polygonal Boundary Transformers
CoaT
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
DC-VAE
(CVPR 2021) DC-VAE: Dual Contradistinctive Generative Autoencoder
LETR
(CVPR 2021 Oral) LETR: Line Segment Detection Using Transformers without Edges
MaskCLIP
Code Release for MaskCLIP (ICML 2023)
PRTR
(CVPR 2021) PRTR: Pose Recognition with Cascade Transformers
TESTR
(CVPR 2022) Text Spotting Transformers
TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
ViTGAN
mlpc-ucsd's Repositories
mlpc-ucsd/BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
mlpc-ucsd/CoaT
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
mlpc-ucsd/LETR
(CVPR 2021 Oral) LETR: Line Segment Detection Using Transformers without Edges
mlpc-ucsd/TESTR
(CVPR 2022) Text Spotting Transformers
mlpc-ucsd/PRTR
(CVPR 2021) PRTR: Pose Recognition with Cascade Transformers
mlpc-ucsd/TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
mlpc-ucsd/BoundaryFormer
Code for CVPR2022 paper: Instance Segmentation with Mask-supervised Polygonal Boundary Transformers
mlpc-ucsd/MaskCLIP
Code Release for MaskCLIP (ICML 2023)
mlpc-ucsd/ViTGAN
mlpc-ucsd/DC-VAE
(CVPR 2021) DC-VAE: Dual Contradistinctive Generative Autoencoder
mlpc-ucsd/Patch-DM
Code Release for Patch-DM (ICLR 2024)
mlpc-ucsd/MasQCLIP
(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation
mlpc-ucsd/Guided-VAE
(CVPR 2020) Guided-VAE: Guided Variational Autoencoder for Disentanglement Learning
mlpc-ucsd/BDM
(CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction
mlpc-ucsd/Uni-3D
(ICCV 2023) Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction
mlpc-ucsd/BERT_Convolutions
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
mlpc-ucsd/XTRA
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
mlpc-ucsd/ConstellationNet
(ICLR 2021) ConstellationNet: Attentional Constellation Nets for Few-Shot Learning