some questions

Question

masonwang513 opened this issue 5 years ago · 3 comments

why is it correct to accumulate losses of several frames and compute gradient just based on last frame?

Answer 1 · 2020-06-17T13:01:27.000Z

According to the computation graph, output mask prediction at each frame will be included into gradient computation, not just the last frame.

Answer 2 · 2021-03-22T02:25:54.000Z

Hi，Do you add youtube-vos train dataset for training?

Answer 3 · 2021-03-22T05:22:12.000Z

@shoutOutYangJie Yes, Youtube and DAVIS are hybrid to train the segmentation network, but no synthetic sequence from static image is used.