zengyan-97/X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

PythonBSD-3-Clause

Issues

The Little Daisy Bake Shop - New Website
#34 opened 2 months ago by madelinedefrank
0
VQA: Understanding how the model provides us an answer? Need of answer list?
#24 opened 2 years ago by fizahkhalid
1
Dear Author，Is there any inference code for image retrieval？How can I use this project to inference on my own image-text pairs.
#27 opened 2 years ago by wildwolff
1
An error in the `Retrieval.py`
#33 opened a year ago by jiajinuiuc
0
The torch version out of date
#32 opened a year ago by Hoang-it
0
About training data
#30 opened a year ago by 1049451037
1
Will data leakage happen for bounding box prediction?
#31 opened a year ago by 1049451037
0
Loading frompretrained warnining
#29 opened a year ago by lezhang7
0
Where is the pretrained model's config file?
#28 opened a year ago by lezhang7
0
apply an entire BERT as text encoder
#26 opened 2 years ago by lxianl455
1
VQA: Limitations in questions and answers
#25 opened 2 years ago by fizahkhalid
1
Drawing Attention Heatmap
#11 opened 2 years ago by TheodorPatrickZ
1
Code for Grad-CAM visualization
#23 opened 2 years ago by qiaomu-miao
2
Script to generate RegionTextJsonDataset?
#13 opened 2 years ago by daizuozhuo
4
About swin_B_480
#16 opened 2 years ago by Sxx1995
1
Fine-tune on VQA
#17 opened 2 years ago by darwann
2
Performance of different vision encoders
#18 opened 2 years ago by AI-in-Health
1
About batch sampling `iter_perc`
#19 opened 2 years ago by yangbang18
1
NLVR Pretrain
#21 opened 2 years ago by lonestar234028
1
The code saves the best testing results on Image-Text Retrieval
#22 opened 2 years ago by yangbang18
0
Finetuning On NLVR2
#20 opened 2 years ago by lonestar234028
1
inferece api for referring expression comprehension
#15 opened 2 years ago by zzh-tech
0
add web demo/models/datasets to ICML organization on Hugging Face
#14 opened 2 years ago by AK391
0
Training log for the pretrain stage
#12 opened 2 years ago by tgxs002
2
Could you provide your training logs of coco caption? Thank you very much!
#6 opened 2 years ago by pypypypy666
1
pretrain-base-4m for the X_VLM
#9 opened 2 years ago by wfx0330
1
Fine-tuning
#10 opened 2 years ago by TheodorPatrickZ
1
About license
#8 opened 2 years ago by WangWenhao0716
2
Distributed mode for single GPU
#7 opened 2 years ago by TheodorPatrickZ
2
Hello, please ask the train_files in configs/yaml file to "hdfs: // path / to / vg" error, please change how to set up
#3 opened 3 years ago by zhanghehe8
6
Custom image inference
#5 opened 3 years ago by SangMyeongWoh
0
Hi, could you provide the specific commands of finetuning on coco captioning? Thanks!
#4 opened 3 years ago by yaolinli
1
什么时候开源啊
#1 opened 3 years ago by cdqncn
5
Great project, extremely looking for the releasing of the code!
#2 opened 3 years ago by HenryHZY
1