Self-Distilled Internet Photos (SDIP) Dataset

Self-Distilled Flickr (SD-Flickr) Self-Distilled LSUN (SD-LSUN)
SD-Dogs SD-Bicycles
SD-Elephants SD-Horses

Self-Distilled Internet Photos (SDIP) is a multi-domain image dataset. The dataset consists of Self-Distilled Flickr (SD-Flickr) and Self-Distilled LSUN (SD-LSUN) that were crawled from Flickr and LSUN dataset, respectively, and then curated using the method described in our Self-Distilled StyleGAN paper:

Self-Distilled StyleGAN: Towards Generation from Internet Photos
Ron Mokady, Michal Yarom, Omer Tov, Oran Lang, Daniel Cohen-Or, Tali Dekel, Michal Irani, Inbar Mosseri https://arxiv.org/abs/2202.12211

Overview

StyleGAN’s fascinating generative and editing abilities are limited to structurally aligned and well-curated datasets. It does not work well on raw datasets downloaded from the Internet. The SDIP domains presented here, which are StyleGAN-friendly, were automatically curated by our method from raw images collected from the Internet. The raw uncurated images in Self-Distilled Flicker (SD-Flickr) were first crawled from Flickr using a simple keyword (e.g. 'dog' or 'elephant').

The dataset in this page exhibits 4 domains: SD-Dogs (126K images), SD-Elephants (39K images), SD-Bicycles (96K images), and SD-Horses (162K images). Our curation process consists of a simple pre-processing step (off-the-shelf object detector to crop the main object and then rescale), followed by a sophisticated StyleGAN-friendly filtering step (which removes outlier images while maintaining dataset diversity). This results in a more coherent and clean dataset, which is suitable for training a StyleGAN2 generator (see more details in our paper).

The data itself is saved in a json format: for SD-Flickr we provide urls of the original images and bounding boxes used for cropping; for SD-LSUN we provide image identifiers with the bounding boxes. In addition to the SDIP dataset, we also provide weights of pre-trained StyleGAN2 models trained using each image domain provided here.

Download

Script is coming up soon.

Self-Distilled Flickr (SD-Flickr)

Image Domains

We provide high-quality image collections for two domains curated from Flickr: 'Dogs' and 'Elephants'. Each image in SD-Flickr is given by a URL to the original image and a bounding box that indicates the crop we performed to obtain StyleGAN training data.

Domain Name File #Images Description
SD-Dogs ├  dogs.json 126K Metadata for SD-Dogs including URLs and object bounding boxes.
SD-Elephants ├  elephants.json 39K Metadata for SD-Elephants including URLs and object bounding boxes.

Pre-trained StyleGAN2 models

Coming up soon!

Self-Distilled LSUN (SD-LSUN)

Image Domains

We provide image collections for two LSUN domains: 'Horses' and 'Bicycles'. Each image in SD-LSUN is given by the image name, as appears in the LSUN dataset, and a bounding box that indicates the crop we performed to obtain StyleGAN training data. The LSUN images can be downloaded from here.

Domain Name File #Images Description
SD-Horses ├  horses.json 162K Metadata for SD-Horse including the names of the filtered images and object bounding boxes.
SD-Bicycles ├  bicycles.json 96K Metadata for SD-Bicycles including the names of the filtered images and object bounding boxes.

Pre-trained StyleGAN2 models

Coming up soon!

Citation

If you plan to use this dataset, or the published code, please cite it as:

@misc{mokady2022selfdistilled,
      title={Self-Distilled StyleGAN: Towards Generation from Internet Photos}, 
      author={Ron Mokady and Michal Yarom and Omer Tov and Oran Lang and Daniel Cohen-Or and Tali Dekel and Michal Irani and Inbar Mosseri},
      year={2022},
      eprint={2202.12211},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

If you are using LSUN images, please also follow the citation instructions on the LSUN website.

Licenses

The individual images were published in Flickr by their respective authors under either Creative Commons BY 2.0, Public Domain Mark 1.0, or Public Domain CC0 1.0. All of these licenses allow free use, redistribution, and adaptation. However, some of them require giving appropriate credit to the original author, as well as indicating any changes that were made to the images.

The datasets (including JSON metadata and documentation) and pre-trained models are made available under CC-BY-4.0. If you use the data or models, please give appropriate credit by citing our paper.

Privacy

When collecting the data, we were careful to only include photos that – to the best of our knowledge – were intended for free use and redistribution by their respective authors. That said, we are committed to protecting the privacy of individuals who do not wish their photos to be included. To get your photo removed from Contact self-distilled-stylegan@google.com. Please include the image URL in the mail.