streamlit run streamlit_app.py
To serve previews from the local dataset copy, please enable static file serving:
export STREAMLIT_SERVER_ENABLE_STATIC_SERVING=1
All datasets share identical format:
images.txt
contains newline-delimited URLs or Base64-encoded data-URIs of images.images.fbin
contains a binary matrix of UForm embedding for every image fromimages.txt
.images.usearch
contains a binary USearch search index for fast kANN.
Additionally, some image-text paired datasets may provide texts.txt
, texts.fbin
, texts.usearc
, following the same logic.
wget -O images.txt https://huggingface.co/datasets/unum-cloud/gallery-unsplash25k/resolve/main/images.txt
wget -O images.fbin https://huggingface.co/datasets/unum-cloud/gallery-unsplash25k/resolve/main/images.fbin
wget -O images.usearch https://huggingface.co/datasets/unum-cloud/gallery-unsplash25k/resolve/main/images.usearch
wget -O images.txt https://huggingface.co/datasets/unum-cloud/gallery-cc4m/resolve/main/images.txt
wget -O images.fbin https://huggingface.co/datasets/unum-cloud/gallery-cc4m/resolve/main/images.fbin
wget -O images.usearch https://huggingface.co/datasets/unum-cloud/gallery-cc4m/resolve/main/images.usearch
wget -O images.txt https://huggingface.co/datasets/unum-cloud/gallery-laion400m/resolve/main/images.txt
wget -O images.fbin https://huggingface.co/datasets/unum-cloud/gallery-laion400m/resolve/main/images.fbin
wget -O images.usearch https://huggingface.co/datasets/unum-cloud/gallery-laion400m/resolve/main/images.usearch