google-research-datasets/wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
NOASSERTION
Issues
- 1
Article (or category) list of the dataset
#4 opened by shiv6891 - 3
Baseline Models in WIT Paper
#9 opened by iamjanvijay - 1
release date of murals
#5 opened by LeeRock - 2
Actual size of the dataset & images
#1 opened by pabloppp - 4
What is the suggested way to download images
#6 opened by srg9000 - 2
split tsv data error
#3 opened by QYinyourmind