/www-sitemap-generator

Primary LanguagePythonMozilla Public License 2.0MPL-2.0

www-sitemap-generator

A tool for ingesting the sitemap.json file from www.mozilla.org and creating sitemap protocol compatible XML files with lastmod datetimes based on changes to etags.

Usage

$ ./generate_sitemap_docker.sh

This will build the docker image, run update_etags.py which gets https://www.mozilla.org/sitemap.json, saves it, then uses that data to do GET requests against all of the URLs to get and compare the etags in the responses, and then run generate_sitemaps.py which takes the data saved by update_etags.py and generates the XML sitemaps for use on the site.

Development

Install pre-commit, and then run pre-commit install and you'll be setup to auto format your code according to our style and check for errors for every commit.