Pinned Repositories
ACmix
Official repository of ACmix (CVPR2022)
Agent-Attention
Official repository of Agent Attention (ECCV2024)
ARC
[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection
DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
GSVA
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
MLLA
Official repository of MLLA (NeurIPS 2024)
Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Slide-Transformer
Official repository of Slide-Transformer (CVPR2023)
LeapLabTHU's Repositories
LeapLabTHU/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
LeapLabTHU/MLLA
Official repository of MLLA (NeurIPS 2024)
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
LeapLabTHU/Slide-Transformer
Official repository of Slide-Transformer (CVPR2023)
LeapLabTHU/Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
LeapLabTHU/ARC
[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection
LeapLabTHU/ExpeL
LeapLabTHU/GSVA
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
LeapLabTHU/AdaFocusV2
LeapLabTHU/Rank-DETR
[NeurIPS 2023] Rank-DETR for High Quality Object Detection
LeapLabTHU/ProCo
[TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
LeapLabTHU/Segment3D
LeapLabTHU/LAUDNet
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
LeapLabTHU/Attention-Mediators
[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
LeapLabTHU/Dynamic_Perceiver
Official implementation of Dynamic Perceiver
LeapLabTHU/ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
LeapLabTHU/FamO2O
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
LeapLabTHU/InLine
Official repository of InLine attention (NeurIPS 2024)
LeapLabTHU/L2W-DEN
[ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks
LeapLabTHU/AdaNAT
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
LeapLabTHU/Uni-AdaFocus
Official repository of Uni-AdaFocus (TPAMI 2024).
LeapLabTHU/LearnableISDA
[IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation
LeapLabTHU/SimPro
[ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning
LeapLabTHU/OVM3D-Det
LeapLabTHU/ENAT
[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
LeapLabTHU/DAT-Detection
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
LeapLabTHU/UniTTA
LeapLabTHU/diver-ct