google-research-datasets/conceptual-12m
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
NOASSERTION
Issues
- 0
Query about Fine-tuning
#7 opened by Saloni0512 - 3
Lots of links are not working?
#4 opened by yxchng - 0
Hashcodes don't match
#6 opened by MikeyShechter - 0
Original alt-text
#5 opened by nicolas-dufour - 1
Image-captioning pre-trained model
#3 opened by JohannesTK - 2
The overlap between CC3m and CC12m
#2 opened by weiyx16 - 1
image download script
#1 opened by ShoufaChen