biubiuisacat

Pinned Repositories

airbert-recurrentvln
Language:Python0 0 00
LaBERT
A length-controllable and non-autoregressive image captioning model.
Language:Python00
language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Language:Jupyter Notebook00
PREVALENT_R2R-1
Apply PREVALENT pretrained model on R2R task
Language:Python0 0 00
Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Language:Python0 0 00
ScaleVLN
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
Language:Python0 0 00
video-mamba-suite
Language:Python00
VideoMamba
VideoMamba: State Space Model for Efficient Video Understanding
Language:Python0 0 00
VLN-HAMT
Language:Python0 0 00
VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python0 0 00

biubiuisacat's Repositories

biubiuisacat/airbert-recurrentvln
Language:Python0 0 00
biubiuisacat/LaBERT
A length-controllable and non-autoregressive image captioning model.
Language:Python00
biubiuisacat/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Language:Jupyter Notebook00
biubiuisacat/PREVALENT_R2R-1
Apply PREVALENT pretrained model on R2R task
Language:Python0 0 00
biubiuisacat/Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Language:Python0 0 00
biubiuisacat/ScaleVLN
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
Language:Python0 0 00
biubiuisacat/video-mamba-suite
Language:Python00
biubiuisacat/VideoMamba
VideoMamba: State Space Model for Efficient Video Understanding
Language:Python0 0 00
biubiuisacat/VLN-HAMT
Language:Python0 0 00
biubiuisacat/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python0 0 00