A tool that helps you easily monitor website changes.
It crawls sitemaps, saves them locally, compares with the previous crawl, and generates a README report for easy viewing. It supports monitoring multiple sitemaps in a single report.
It can also run in a GitHub Actions workflow: hackerqed/sitemap-gazer-example
pip install sitemap-gazer
mkdir sitemap-gazer-report
cd sitemap-gazer-report
sitemap-gazer init
# create sitemap-gazer.json
# edit sitemap-gazer.json
# for example, add a target site crazygames.com
# {
# "sites": [
# {
# "name": "crazygames.com",
# "url": "https://crazygames.com/"
# }
# ],
# "genReadme": true,
# "output_dir": "data"
# }
sitemap-gazer
# crawl data, save to ./data/
# README.md will be generated automatically
To run as developer:
poetry install
poetry run sitemap-gazer
To build and install locally:
poetry build
pip install dist/sitemap_gazer-0.1.0-py3-none-any.whl