Pinned Repositories
mlc-imp
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
imp
a family of highly capabale yet efficient large multimodal models
mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
mmnas
Deep Multimodal Neural Architecture Search
openvqa
A lightweight, scalable, and general framework for visual question answering research
prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
rosita
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
xmchat
MIL-VLG's Repositories
MIL-VLG/mlc-imp
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.