Collector for Peace and Security Pillar Datasets

Build Status Coverage Status

This script connects to the Peace and Security API and extracts data and metadata from all endpoints creating 62 datasets in HDX. It makes 63 reads from the data hub and 62 read/writes (API calls) to HDX in a one hour period. It creates 62 temporary files each a few Kb which it uploads into HDX. It is run every day.

Usage

python run.py

For the script to run, you will need to have a file called .hdx_configuration.yaml in your home directory containing your HDX key eg.

hdx_key: "XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX"
hdx_read_only: false
hdx_site: prod

You will also need to supply the universal .useragents.yaml file in your home directory as specified in the parameter user_agent_config_yaml passed to facade in run.py. The collector reads the key hdx-scraper-peacesecurity as specified in the parameter user_agent_lookup.

Alternatively, you can set up environment variables: USER_AGENT, HDX_KEY, HDX_SITE, TEMP_DIR, LOG_FILE_ONLY