/sitemap-gazer

Easy monitor website changes

Primary LanguagePython

sitemap-gazer

README 中文版

A tool that helps you easily monitor website changes.

It crawls sitemaps, saves them locally, compares with the previous crawl, and generates a README report for easy viewing. It supports monitoring multiple sitemaps in a single report.

It can also run in a GitHub Actions workflow: hackerqed/sitemap-gazer-example

Installation

pip install sitemap-gazer

mkdir sitemap-gazer-report
cd sitemap-gazer-report

sitemap-gazer init
# create sitemap-gazer.json

# edit sitemap-gazer.json
# for example, add a target site crazygames.com
# {
#   "sites": [
#     {
#       "name": "crazygames.com",
#       "url": "https://crazygames.com/"
#     }
#   ],
#   "genReadme": true,
#   "output_dir": "data"
# }

sitemap-gazer
# crawl data, save to ./data/
# README.md will be generated automatically

Development

To run as developer:

poetry install
poetry run sitemap-gazer

To build and install locally:

poetry build
pip install dist/sitemap_gazer-0.1.0-py3-none-any.whl