/corpus

Official QuantGov Corpora

Primary LanguagePython

The QuantGov Corpus

Official QuantGov Corpora

This repository is for those who would like to create new datasets using the QuantGov platform. If you would like to find data that has been produced using the QuantGov platform, please visit https://www.quantgov.org/data.

This repository contains all official QuantGov corpora, with each corpus stored in its own branch.

The Generic Corpus

The master branch of this repository is the Generic Corpus, which serves all files in the data/clean folder, with the file path as the index. See the Snakefile for more details.

Using this Corpus

To use or modify this corpus, clone it using git or download the archive from the QuantGov Site and unzip it on your computer.

Requirements

Using this corpus requires the QuantGov library.