This repository scrapes 'meta' information about journal articles published in the AER from 1999 to present.
We use python
and selenium
to control a headless instance of Google chrome
To run the code:
- install chromedriver following the receipe here
- Install Snakemake from command line:
pip install snakemake
- Ensure all python packages are installed from command line:
pip install -r requirements.txt
- Run
snakemake
from the command line- Sometimes one or more of the aer issues won't run through completely, a workaround curently used is
snakemake --keep-going
to continue running through all jobs. Then re-runsnakemake --keep-going
until all targets are constructed
- Sometimes one or more of the aer issues won't run through completely, a workaround curently used is
Tested on: Ubuntu: 16.04 Last Run: Feb 2, 2018