- Mihir Prabhudesai, Hsiao-Yu Fish Tung, Syed Ashar Javed, et al. Embodied Language Grounding With 3D Visual Feature Representations. [Paper] [Code] - Nuri
- Wu Jialian, Zhou Chunluan, Yang Ming, et al. Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians. [Paper] - Kyungdo