/CrossVLT

Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation (Published in IEEE TMM 2023)

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.