Vocabulary and single image-question pair prediction
foxm79 opened this issue · 2 comments
foxm79 commented
- Is the vocabulary available that takes the words of the questions and converts them to 'input_ids'?
- Is there a function that does this for an input question?
- Is there a code that take a single image-question pair and predicts the answer?
tjulyz commented
- Refer to the prepro.py in scripts
foxm79 commented
Yes, that is what I followed eventually. Thanks for replying !