CD Cover Image Crawl

This project provide two ways to download the CD image.

  • Discogs website
  • Google Image Search Engine

Prerequist:

  • lxml
  • requests
  • urllib

Get information from Discogs

Discogs provide raw dataset and database API. We can directly analyze the url to fetch data.

Fetch the information through request.

Run the code to download the image and generate search name list. cd code python main.py

Download image from Google Image

The code will download image by the search name list (e.g., CD_list.txt) to download top "n" images. The current search name list is generated by Discogs/code/main.py

cd Google
python main.py