lyxok1/STM-Training

some questions

masonwang513 opened this issue · 3 comments

why is it correct to accumulate losses of several frames and compute gradient just based on last frame?

According to the computation graph, output mask prediction at each frame will be included into gradient computation, not just the last frame.

Hi,Do you add youtube-vos train dataset for training?

@shoutOutYangJie Yes, Youtube and DAVIS are hybrid to train the segmentation network, but no synthetic sequence from static image is used.