/pycrawl

Crawling webpages for structured data.

Primary LanguagePythonMIT LicenseMIT

pycrawl: Web crawler for generating structured data.

Usage:

Create virtual python environment:

python3 -m venv myenv
source myenv/bin/activate

Run the crawler to fetch JSON-LD data. Edit crawl.py to place the URL.

pip3 install -m requirements.txt
python3 crawl.py

Run deactivate to stop the python virtual environment.

Credits / references