[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
- AsterisciXi’an Jiaotong University
- chaoyivisionThe University of Sydney (USYD)
- Chuan-shanjia
- CurryYuanThe Chinese University of Hong Kong, Shenzhen
- cwhao98
- daveredrumHuawei Noah's Ark Lab
- fly51flyPRIS
- FMCalistoInstitute for Systems and Robotics
- ggsonic
- heng-hw
- JiayuXu829
- KESHEN-ZHOUUniversity of Sydney
- kevinszjmmShanghai
- linhaojia13Xiamen University
- luomingshuangICT, UCAS, Peng Cheng Lab
- Nightmare-nZhejiang University
- qiruiwVancouver
- Samir55KAUST
- SekundeGenerative AI, Meta
- sunanhe
- sxpmwmh
- Tomato1107Okayama University
- vice-jinUSST
- W1zheng
- Worker789
- Xiaolong-RRLPKU
- yangpanquanXidian University
- yanx27The Chinises University of Hong Kong, Shenzhen
- yh-heHangzhou
- yinyunieTechnical University of Munich
- youquanlHochschule Bremerhaven
- yuchenlichuckSony AI
- zehanwang01
- Zhang-Jing-XuanEast China University of Science and Technology
- zhangjb416
- zhouhuanly