Pinned Repositories
aopolin-lv.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
ECSpell
pic_mgr
Manager of picture
RoboMP2
RoboMP2.github.io
VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
VSENet
Data and code for Visual Subtitle Feature Enhanced Video Outline Generation
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
RLBench
A large-scale benchmark and learning environment.
VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
aopolin-lv's Repositories
aopolin-lv/ECSpell
aopolin-lv/VSENet
Data and code for Visual Subtitle Feature Enhanced Video Outline Generation
aopolin-lv/RoboMP2
aopolin-lv/aopolin-lv.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
aopolin-lv/pic_mgr
Manager of picture
aopolin-lv/RoboMP2.github.io
aopolin-lv/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
aopolin-lv/OCRM_survey
A Survey of Embodied Learning for Object-Centric Robotic Manipulation