Pinned Repositories
mmgnn_textvqa
A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
env-qa
envqa.github.io
gui_parser
mmgnn_textvqa
A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial
assistgpt
Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
computer_use_ootb
An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS
maybelu9's Repositories
maybelu9/env-qa
maybelu9/CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
maybelu9/envqa.github.io
maybelu9/gui_parser
maybelu9/mmgnn_textvqa
A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
maybelu9/openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial