This repo is specifically written to utilise Google's search by image function and download visually similar images. This repo might be useful in collecting images that are visually similar to an image URL.
- Activate the virtual environment
seleniumVenv\Scripts\activatein Windowssource seleniumVenv/bin/activatein Linux
- Download the required packages
pip install -r requirements.txt - Make sure you have the correct version of Selenium in
driversfolder. For more information, refer to Selenium Official Doc. - Start using!
python searchByImage.py "https://upload.wikimedia.org/wikipedia/commons/thumb/9/9e/Giraffe_Mikumi_National_Park.jpg/800px-Giraffe_Mikumi_National_Park.jpg" 5Show help:
$ python searchByImage.py -h
usage: searchByImage.py [-h] [--output OUTPUT] [--exclude_stock EXCLUDE_STOCK] image_url n
Fetch image URLs from Google Image Search.
positional arguments:
image_url image search URL
n number of images (approx)
optional arguments:
-h, --help show this help message and exit
--output OUTPUT file to write URLs into
--exclude_stock EXCLUDE_STOCK
Boolean to indicate whether to exclude stock photos. Default: False
In order to output the downloaded image URLs into a text file, we can do so:
python searchByImage.py "https://upload.wikimedia.org/wikipedia/commons/thumb/9/9e/Giraffe_Mikumi_National_Park.jpg/800px-Giraffe_Mikumi_National_Park.jpg" 5 --output giraffe.txtSometimes Google image will show images from stock websites and they might have watermarks. In order to avoid downloading images from certain stock websites, you can include the --exclude_stock argument. Example:
python searchByImage.py "https://upload.wikimedia.org/wikipedia/commons/thumb/9/9e/Giraffe_Mikumi_National_Park.jpg/800px-Giraffe_Mikumi_National_Park.jpg" 5 --exclude_stock