Pinned Repositories
ARKseal.github.io
Beyond-Nature
CLIP
Contrastive Language-Image Pretraining
crawlingathome
A client library for Crawling@Home's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
crawlingathome-fileserver
A server used for storing CPU workers' uploads, ready for GPU workers to complete.
crawlingathome-gpu-hcloud
GPU controlled Hetzner Cloud workers swarm for Crawling@Home project
crawlingathome-gpu-kaggle
crawlingathome-server
A server powering Crawling@Home's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
crawlingathome-worker
webdataset-utils
arkseal's Repositories
arkseal/crawlingathome-worker
arkseal/webdataset-utils
arkseal/ARKseal.github.io
arkseal/Beyond-Nature
arkseal/CLIP
Contrastive Language-Image Pretraining
arkseal/crawlingathome
A client library for Crawling@Home's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
arkseal/crawlingathome-fileserver
A server used for storing CPU workers' uploads, ready for GPU workers to complete.
arkseal/crawlingathome-gpu-hcloud
GPU controlled Hetzner Cloud workers swarm for Crawling@Home project
arkseal/crawlingathome-gpu-kaggle
arkseal/crawlingathome-server
A server powering Crawling@Home's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
arkseal/crawlingathome-wheels
The package wheels for the Crawling At Home project
arkseal/firebase-test
This is a test for firebase hosting
arkseal/Get_On_Bot
arkseal/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
arkseal/LAION-5B-WatermarkDetection
arkseal/ohyeah
arkseal/shshacks2022
arkseal/SmartDataset
A Dataset, made to be smart and efficient :)
arkseal/watermark-detection
A repository containing datasets and tools to train a watermark classifier.
arkseal/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.