/job_scraper

Scraping jobs into easy to filter dataframes

Primary LanguageJupyter Notebook

job_scraper

Simple python tools for scraping jobs into a dataframe, removing duplicates. Given the nature of web scraping, strategies have to be specific to sites so currently, only tools for jobs.ac.uk (primarily academic posts) have been created. The plan is to expand this further.

Installing dependencies - command line with Anaconda / miniconda

  1. clone this repository to a desired location $ git clone https://github.com/VolodymyrChapman/job_scraper.git
  2. navigate into the cloned repository $ cd job_scraper
  3. install dependencies from environment file $ conda env create -f environment.yml

Usage:

For usage examples, please refer to the example_usage.ipynb notebook.

Finally, PRs etc.

Collaboration, especially to expand functionality to other sites would be greatly appreciated.

Please feel free to clone, expand, submit pull requests etc.