This repository contains a basic scraper for Thuisarts.
The script produces three files:
thuisarts.yaml
contains topics along with an ID and a link to the summary pagethuisarts-synonyms.yaml
contains a mapping of topics fromthuisarts.yaml
to their synonyms (to create a larger index)thuisarts-summaries
is a folder with .txt files, containing the summary text for the topic matching that ID