implementation detail

Question

implementation detail

Closed this issue 5 years ago · 1 comments

Hey @WingsBrokenAngel nice work thanks for making code publicly for further research.

I have a query regarding implementation, since MSR-VTT have 20 captions for each video. How you have deal with them during?
Did you took random caption for video in each epoch or you have just repeated the features for each caption?
By look the implementation i think you have took all the caption with repeated features. Am I right ?

Answer 1 · 2020-05-08T17:01:52.000Z

Yes, you are right.