-
ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes, Stanford University, ECCV 2020 Oral [Project] [Paper] [Code]
-
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language, Technical University of Munich, ECCV 2020 [Project] [Paper] [Code]
-
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images, Shenzhen Research Institute of Big Data, CUHK-Shenzhen, CVPR 2021 [Project] [Paper] [Code]
-
Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud, Xidian University, ICCV 2021 [Paper] [Code]
-
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring, The Chinese University of Hong Kong (Shenzhen), ICCV 2021 [Paper] [Code]
-
3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds, College of Software, Beihang University, ICCV 2021 [Paper] [Code]
-
SAT: 2D Semantics Assisted Training for 3D Visual Grounding, University of Rochester, ICCV 2021, Oral [Paper] [Code]
-
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding, School of Computer Science and Engineering, Beihang University, ACM MM 2021 [Paper] [Code]
-
LanguageRefer: Spatial-Language Model for 3D Visual Grounding, University of Washington, CoRL 2021 [Paper] [Code]
-
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection, Institute of Artificial Intelligence, Beihang University, CVPR 2022, Oral [Paper] [Code]
-
Multi-View Transformer for 3D Visual Grounding, The Chinese University of Hong Kong, CVPR 2022 [Paper] [Code]
-
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding, Inria, École normale supérieure, CNRS, PSL Research University,, NeurIPS 2022 [Paper] [Code]
-
Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding, King Abdullah University of Science and Technology, NeurIPS 2022 [Paper] [Code]
-
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding, Shenzhen Graduate School, Peking University, CVPR 2023 [Paper] [Code]
-
Language-Assisted 3D Feature Learning for Semantic Scene Understanding, Tsinghua University, AAAI 2023, Oral [Paper] [Code]
-
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations, Stanford University, MIT, CVPR 2023 [Paper]
-
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding, Zhejiang University, ICCV 2023 [Paper]
-
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance, Shanghai Artificial Intelligence Laboratory [Paper]
-
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding, Zhejiang University [Paper]
-
A Unified Framework for 3D Point Cloud Visual Grounding, Xiamen University [Paper][Code]