This repo is accompanying an article on freeCodeCamp. It explains how to crawl a set of static sites as well as a Single Page Application.
Make sure, you update the dependencies locally, as I want to have the versions here pinned. Be aware, they they contain known vulnerabilities!
It is recommended to clone this repo into a virtual environment. The code was written in Python 3.6 on Sabayon Linux. However, other platforms should be supported as well.
virtualenv fcc-web-scraping-example --python=python3
cd fcc-web-scraping-example
. bin/activate
git clone src
cd src
pip install -r requirements.txt
For testing, you'll need to install the package locally as well.
pip install -e .
python -m pytest tests/
GPL v3 or later. See LICENSE.