Multimedia Computing Group, Nanjing University

Nanjing

Pinned Repositories

AdaMixer
[CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector
Language:Jupyter Notebook234 6 2724
CamLiFlow
[CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
Language:Python231 6 1423
EMA-VFI
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
Language:Python425 2 2842
MixFormer
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
Language:Python470 7 11173
MixFormerV2
[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking
Language:Python145 10 4220
MOC-Detector
[ECCV 2020] Actions as Moving Points
Language:Python270 11 4337
SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Language:Python372 9 9029
SparseOcc
[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
Language:Python294 5 5425
TDN
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Language:Python375 10 7155
VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.4k 16 125137

Multimedia Computing Group, Nanjing University's Repositories

MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.4k 16 125137
MCG-NJU/MixFormer
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
Language:Python470 7 11173
MCG-NJU/SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Language:Python372 9 9029
MCG-NJU/SparseOcc
[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
Language:Python294 5 5425
MCG-NJU/CamLiFlow
[CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
Language:Python231 6 1423
MCG-NJU/MeMOTR
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
Language:Python171 7 2312
MCG-NJU/MixFormerV2
[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking
Language:Python145 10 4220
MCG-NJU/MOTIP
Multiple Object Tracking as ID Prediction
Language:Python124 7 3810
MCG-NJU/AWT
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Language:Python88 4 12
MCG-NJU/LinK
[CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception
Language:Python88 7 75
MCG-NJU/VFIMamba
[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models
Language:Python74 1 77
MCG-NJU/SGM-VFI
[CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion
Language:Python66 4 55
MCG-NJU/BIVDiff
[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Language:Python63 3 42
MCG-NJU/PointTAD
[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
Language:Python41 4 61
MCG-NJU/CoMAE
[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
Language:Python34 2 44
MCG-NJU/FlowDCN
[NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Language:Python26 2 1
MCG-NJU/p-MoD
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Language:Python25 2 32
MCG-NJU/Dynamic-MDETR
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Language:Python23 2 10
MCG-NJU/SPLAM
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
Language:Python19 2 11
MCG-NJU/ZeroI2V
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
Language:Python19 2 00
MCG-NJU/AMD
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Language:Python17 2 21
MCG-NJU/SportsHHI
[CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos
Language:Python12 1 12
MCG-NJU/VLG
[IJCV] VLG: General Video Recognition with Web Textual Knowledge (https://arxiv.org/abs/2212.01638)
Language:Python10 1 00
MCG-NJU/StageInteractor
[ICCV 2023] StageInteractor: Query-based Object Detector with Cross-stage Interaction
Language:Python9 2 01
MCG-NJU/DGN
[IJCV 2023] Dual Graph Networks for Pose Estimation in Crowded Scenes
Language:Python8 2 11
MCG-NJU/ProVP
[IJCV] Progressive Visual Prompt Learning with Contrastive Feature Re-formation
Language:Python8 4 00
MCG-NJU/ViT-TAD
[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
Language:Python8 1 00
MCG-NJU/LogN
[IJCV 2024] Logit Normalization for Long-Tail Object Detection
Language:Python6 1 1
MCG-NJU/PRVG
[CVIU 2024] End-to-end dense video grounding via parallel regression
Language:Python6 1 00
MCG-NJU/VideoEval
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
Language:Python6 1 00