This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"
π₯π₯π₯Coming Soonπ₯π₯π₯
-
Download A2D-Sentences and JHMDB-Sentences.
-
Please use RAFT to generate the opticla flow map for each frame.
-
Put them as follows:
your dataset dir/
βββ A2D/
βββ allframes/
βββ allframes_flow/
βββ Annotations_visualize
βββ a2d_txt
βββtrain.txt
βββtest.txt
βββ J-HMDB/
βββ allframes/
βββ allframes_flow/
βββ Annotations_visualize
βββ jhmdb_txt
βββtrain.txt
βββtest.txt
"Annotations_visualize" contains the GT masks for each target object. We have upload them to BaiduPan(lo50) for convenience.
Comming Soon
Comming Soon
Please consider citing our work in your publications if you are interest in our research:
Comming Soon