Westac Project, 2020-2021
The full data set consists of multiple parts:
- Riksdagens protokoll between from 1921 until today in the Parla-clarin format
- Comprehensive list of MPs and cabinet members during this period
- Traceable logs of all curation and segmentation as a git history
A full dataset is available under the zip download on this page. The unzipped folder is structured in the following manner
- Annual protocol files in the
corpus/
folder - List of MPs
corpus/members_of_parliament.csv
The corpora are large and automatically curated and segmented. If you find any errors, it is possible to submit corrections to them. This is documented in the project wiki.