google-research-datasets/conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

NOASSERTION

Issues

Query about Fine-tuning
#7 opened 3 months ago by Saloni0512
0
Lots of links are not working?
#4 opened 2 years ago by yxchng
3
Hashcodes don't match
#6 opened a year ago by MikeyShechter
0
Original alt-text
#5 opened a year ago by nicolas-dufour
0
Image-captioning pre-trained model
#3 opened 4 years ago by JohannesTK
1
The overlap between CC3m and CC12m
#2 opened 4 years ago by weiyx16
2
image download script
#1 opened 4 years ago by ShoufaChen
1