uclanlp/visualbert
Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"
Python
Issues
- 0
How to retrain VisualBERT on another dataset?
#46 opened by Tess314 - 0
Visualbert VQA model inference lower accuracy in validation around 40% by huggingface framework
#45 opened by guanhdrmq - 0
Minimum GPU requirement
#44 opened by nowhash - 0
- 0
- 0
VisualBERT with Detectron2
#38 opened by smfsamir - 0
- 6
Extracting image features for VQA
#10 opened by johntiger1 - 8
Extracting Detectron Features
#1 opened by sanjayss34 - 2
COCO features
#22 opened by qinzzz - 0
- 7
allennlp 0.8.0 .
#20 opened by alice-cool - 0
checkpoints for flickr30k?
#35 opened by ziyanyang - 0
- 1
about chinese
#32 opened by Soulscb - 1
Features vqacoco-pre-train
#31 opened by RitaRamo - 1
Bert-large
#27 opened by 1144181135 - 0
Contact author
#33 opened by zxy-in - 1
About evaluation
#30 opened by renmada - 0
Process finished with exit code 137
#29 opened by lifebl - 1
How to make evaluation on VQA?
#5 opened by yangapku - 4
- 1
Flickr30k entities support
#6 opened by fbrad - 1
about MCAN
#19 opened by ckj1221 - 1
sentence image matching
#21 opened by goodbai2 - 1
Visual Features Computation
#24 opened by michelecafagna26 - 1
seq_relationship_score logits order
#26 opened by michelecafagna26 - 1
Question about visualBERT
#8 opened by YuBeomGon - 1
- 1
COCO pre-training size
#17 opened by e-bug - 0
有大佬开源个Keras版本的吗?
#23 opened by fengxin619 - 1
How to Generate Visual Attention Maps
#16 opened by g-luo - 1
- 2
Using visualBERT for generation
#14 opened by nishanthcgit - 2
Pre-training on other BERT models
#12 opened by Muennighoff - 1
Number of ROIs
#11 opened by e-bug - 2
- 2
Flickr30k Entities fine-tuning clarification
#9 opened by tonyduan - 1
"pre-training" section in the readme
#7 opened by johntiger1 - 2
config_vcr is no where to be found
#3 opened by ChenghaoMou