/DMFormer

[IEEE TCSVT] Decoupled Multimodal Transformers (DMFormer) for Referring Video Object Segmentation

Primary LanguagePython

Watchers