img2dataset
There are 2 repositories under img2dataset topic.
nopperl/clip_arxiv_pmc
Training CLIP models on Data from Scientific Papers
svjack/img2dataset-pq2hf-transform-toolkit
A simple toolkit to transform datasource generate by img2dataset from parquet file to Huggingface dataset.