henghuiding/Vision-Language-Transformer
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
PythonMIT
Issues
- 2
Test with own dataset
#18 opened by wooj0216 - 0
swin transformer 做视觉特征提取器
#17 opened by liujing0814 - 0
Reproducing on refcocog
#16 opened by freshpearYoon - 0
how to resume training from previous epoch?
#15 opened by freshpearYoon - 0
question about loader.py
#14 opened by freshpearYoon - 2
- 0
- 9
- 0
code about contrastive learning
#11 opened by Chic-J - 0
- 0
Confusion about data_process_v2
#8 opened by huangjy-pku - 4
- 1
How to inference on my own image and text?
#7 opened by kelisiya - 2
About training speed.
#4 opened by chaoqunwangcs - 3
How to perform Inference?
#5 opened by YunLongPan - 2
How to train on the custom data set?
#3 opened by YunLongPan - 1
Two questions about the model training: weight mismatch and yolov3_480000.h5
#1 opened by jianhua2022