Opened this issue 6 years ago · 1 comments
How can we apply this API on the MSCOCO dataset? In MNIST dataset, we have one label corresponding to each image but in MSCOCO, each image has multiple labels (categories) or multiple captions. How to modify the network in this case?
I'm interested on this too.