LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Primary LanguageJupyter NotebookMIT LicenseMIT