/Scrape-website

In this repository, i will pull in some of my experiments in collecting data in bulk by scraping. These codes just be used for research and education.

Primary LanguagePython

Scrape-website

In this repository, i will pull in some of my experiments in collecting data in bulk by scraping. These codes just be used for research and education.

Get book dataset from www.bukabuku.com :

Code: scrape-bukabuku.py
Result: result_bukabuku.xlsx
Steps:

  1. Define the book category (Anak-Anak, Arsitektur & Design Interior, or etc). I choose "Orang Tua & Keluarga, Psikologi).
    image

  2. So, the url link will change to this. "59" is unique number of book category on that website.
    image

  3. Then, i want to get title, author, real price, price after discount, description, and link of the books.
    image
    Book description can be found inside the link of the book (just click the title, and boom!)
    image