microsoft/BridgeTower

Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"

PythonMIT

Issues

finetuning using both itm & itc using huggingface implementation
#15 opened 5 months ago by aretius
3
Processor only accepts 3 channel images
#12 opened 9 months ago by swtb3
1
checkpoint 链接失效
#14 opened 8 months ago by sukeey
1
processor support for single channel?
#13 opened 9 months ago by swtb3
0
about code
#2 opened 2 years ago by zhenjia2017
2
Pretraining Result of BridgeTower
#4 opened 2 years ago by jsg921019
1
Few questions about the implimentation of BridgeTower
#3 opened 2 years ago by jsg921019
4
Pre-trained visual encoder, 4M pretraining?
#1 opened 3 years ago by m-bain
5