/vqa

Visual Question Answering, project for the Object Recognition and Computer Vision course at the MVA.

Primary LanguagePython

VQA

Visual Question Answering project for the Object Recognition and Computer Vision course at the MVA master at ENS Paris-Saclay.

Here is the link to the 10' presentation given for the project.

References

  • Agrawal, A., Lu, J., Antol, S., Mitchell, M., Zitnick, C.L., Batra, D., and Parikh, D. VQA: Visual Question Answering. In ICCV, 2015.
  • VQA API: vqa_api directory.
  • Skipthoughts encoding: skip_thoughts directory.