/Data-Scraping

Information Scraping with Python

Primary LanguagePython

Web Scraping

This was just for Fun-type project to scrap data from various web site. Here we used Python and it’s some powerful packages like selenium, beautiful soup , scrap , request etc.

Selected Sites for data scraping were :

Amazon

  • Scrap best seller book list
  • Scrap most valuable comment
  • Describing info for a book

IMDB

  • Download all top rated movie poster
  • Scrap top rated movies

Linked In

  • Scrap information of target individual
  • Load n times pages at a time

Udemy

  • Scrap list of best selling courses

Installation

Downlaod Python 3x from the official website.

Local

pip install selenium
pip install BeautifulSoup
pip install requests

or just download anaconda distribution for get all the necessary pacakges simultaneously.

Download Chrome WebDriver or Download PhantomJS

git clone https://github.com/iphton/Data-Scraping
cd Web-Scraping
jupyter notebook

Go do the desired notebook, files that end with '.ipynb'. To run the model, go to the menu then click on Cell > Run all


Regards

World of Void.