Question on reproduction

Question

Question on reproduction

Closed this issue 4 years ago · 15 comments

Thank you for sharing your work. I cannot reproduce the results of the paper in practice. May I ask how many epochs were trained during training and how much the final loss converged?

Answer 1 · 2020-09-28T02:18:43.000Z

Do you use the 6-reference test set? 4 epochs during training and nll_loss on validation set is about 1.62.

Answer 2 · 2020-09-28T04:11:52.000Z

I follow the default setting for training the model.

This is the training process.

This is the evaluation. The model output the bad response.

Answer 3 · 2020-09-28T04:34:45.000Z

I think the model is trained enough by myself, maybe there is something to pay attention to when testing? My beam search is set to 5. I am so confused about the output when testing.

Answer 4 · 2020-09-28T07:21:35.000Z

when you are training and testing, what feature do you use?

Answer 5 · 2020-09-28T07:24:43.000Z

Just your default setting('vggish', 'i3d_flow', 'i3d_rgb')

Answer 6 · 2020-09-28T07:27:34.000Z

how about text features? do you use the caption, summary, and dialogue?

Answer 7 · 2020-09-28T07:40:19.000Z

I haven't modified the code yet, just use your original settings, so the text features should be used according to your original code.

Answer 8 · 2020-09-28T08:12:56.000Z

Sorry, I check the code and find a bug in VideoGPT2.py. I think I uploaded the wrong version before. Now it will be the right version and you can try again. Hope you can get the right results. If there are still any problems, please don't hesitate to ask me.

Answer 9 · 2020-09-28T08:18:07.000Z

Thank you for your patient reply, I will try again.

Answer 10 · 2020-09-28T08:20:28.000Z

When testing, you can use the 6-ref test set to compute the metrics.

Answer 11 · 2020-09-30T06:53:25.000Z

Hello, I modified the vidioGPT.py file as you did, but after the modification, after one epoch during my training, the loss no longer drops. The model cannot converge completely. All outputs are tag when testing.

Answer 12 · 2020-09-30T07:00:32.000Z

Sorry about that, I will rerun it at once. I will check it again. Because of something fusion with other projects, the version may be in disorder. I will inform you at once when I find out why.

Answer 13 · 2020-09-30T09:06:12.000Z

Ok, Thanks.

Answer 14 · 2020-10-02T02:30:56.000Z

I fixed the bug. You can try again. It should be ok. Be sure that using the default settings, including batch_size, learning rate, etc. The nll loss can be about 1.62.

Answer 15 · 2020-10-04T05:50:38.000Z

Thanks, I can get a reasonable result.