questions about MSVD
Closed this issue · 2 comments
xixiareone commented
The article mentions that "where they randomly chose 5 ground-truth sentences per video. We use the same setting when we compare with that approach".Does the training set, validation set and test set all take 5 sentences at random?
Not all sentences are used in training set, validation set and test set?
niluthpol commented
5 ground-truth sentences per video are used when compared with LJRV[24] (LJRV picked five ground truth descriptions per video). See Table 2 (Partition used by LJRV [24]).
xixiareone commented
can you share code about MSVD? thank you very much!
…------------------ 原始邮件 ------------------
发件人: "Niluthpol Chowdhury Mithun"<notifications@github.com>;
发送时间: 2020年5月28日(星期四) 凌晨2:35
收件人: "niluthpol/multimodal_vtt"<multimodal_vtt@noreply.github.com>;
抄送: "Gin"<xixiareone@foxmail.com>;"Author"<author@noreply.github.com>;
主题: Re: [niluthpol/multimodal_vtt] questions about MSVD (#17)
5 ground-truth sentences per video are used when compared with LJRV[24] (LJRV picked five ground truth descriptions per video). See Table 2 (Partition used by LJRV [24]).
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or unsubscribe.