hekj

Ph.D. student at CASIA NLPR CRIPAC. Research interests involve Machine Learning, Multimodality, and Embodied AI.

Institute of Automation, Chinese Academy of Sciences BEIJING, CHINA

Pinned Repositories

awesome-embodied-vision
Reading list for research topics in embodied vision
1 0 00
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 0 00
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
Language:Python0 0 00
cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
Language:TeX0 0 00
FDA
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
Language:Python13 3 30
Landmark-RxR
A human-annotated, fine-grained dataset for Vision-and-Language Navigation
11 1 11
Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Language:Python0 0 00
RxR
Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual perceptions of the annotators
Language:Python0 0 00
VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
Language:Python0 0 00

hekj/FDA
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
Language:Python13 3 30
hekj/Landmark-RxR
A human-annotated, fine-grained dataset for Vision-and-Language Navigation
11 1 11
hekj/awesome-embodied-vision
Reading list for research topics in embodied vision
1 0 00
hekj/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 0 00
hekj/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
Language:Python0 0 00
hekj/cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
Language:TeX0 0 00
hekj/Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Language:Python0 0 00
hekj/RxR
Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and Telugu, and 126k navigation following demonstrations. Both annotation types include dense spatiotemporal alignments between the text and the visual perceptions of the annotators
Language:Python0 0 00
hekj/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
Language:Python0 0 00