이미지에 자연어로 묻고 그에 대한 답을 얻어내는 모바일 앱 제작
- React Native
- GCP (GPU)
- Pytorch
- Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, & Marcus Rohrbach (2016). Multimodal Compact Billinear Pooling for Visual Question Answering and Visual Grounding.
Paper Link: https://arxiv.org/pdf/1606.01847.pdf