/crawlingathome

A client library for Crawling@Home's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.