jnhwkim/ban-vqa

Evaluating accuracy on test?

HuyRyu opened this issue · 2 comments

When I run python3 test.py --label mytest, i got a warning 'RNN module weights are not part of single contiguous chunk of memory. This means they need to be compacted at every call, possibly greately increasing memory usage. To compact weights again call flatten_parameters().' and code still complete but the result was evaluated on VQA challenge only 1% for overall. I use the your pretrained model and feature.

Warning message should not affect the result. I have checked the reproduction of the result, so I kindly recommend you to check the procedure. My hunch is your vocabulary cache is not matched with our pretrained model. Let me know if the issue is still persistent.

See the last section of this script.

It done, thank you very much.