/glassdoor-salary-scraper

Web crawler/scraper for Glassdoor salary data

Primary LanguagePython

glassdoor-salary-scraper

Current Version: 2.0

Glassdoor web crawler and scraper providing salary data. Forked and modified from williamxie11.

Installation

  • Python 2.7.*

  • Beautiful Soup 4 (4.4.1)

$ pip install bs4
  • Selenium Webdriver
$ pip install selenium

Usage

  1. Open the scraper Python script with a text editor of your choice.

  2. Add your Glassdoor account username and password.

  1. Specify the cities you would like to scrape in cities.txt with each city on a new line

  2. Run the scraper

$ python scraper.py

NOTE: Glassdoor will require you to insert CAPTCHA on login or during the scraping process. The script will poll until CAPTCHA is entered during scraping.

Results

response

The web scraper will output a JSON with the name "[city name].json" in the Data directory, it will also output a JSON with the name "allcities.json" that will include all the cities data. Each data point in the JSON corresponds to one salary on Glassdoor with attributes (company name, job title).