/vqa

Model: Visual Question Answering with React Native

Primary LanguageJavaScript

VQA: Visual Question Answering

이미지에 자연어로 묻고 그에 대한 답을 얻어내는 모바일 앱 제작

vqa_phone

Powered by

  • React Native
  • GCP (GPU)
  • Pytorch

Reference Paper

  • Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, & Marcus Rohrbach (2016). Multimodal Compact Billinear Pooling for Visual Question Answering and Visual Grounding.

Paper Link: https://arxiv.org/pdf/1606.01847.pdf

vqa_model