/resource-families-link-checker

Scrapy-based link checker for the CLARIN resource families

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

resource-families-link-checker

Scrapy-based link checker for the CLARIN resource families. Inspired by:

Use: scrapy crawl resfam -o resfam-20200520.csv &> logs-resfam-20200520.txt

This will store the resulting CSV with the check results in resfam-20200520.csv and store verbose logs in logs-resfam-20200520.txt

See the output directory for some examples of output and log files.