TJET data pipeline

This repository contains the code for processing TJET raw data and assembling datasets, including:

  1. downloading all data from the Airtable development bases (TJET MegaBase and TJET Prosecutions)

    • pipeline/downloads.R
  2. processing these data for the website and for analyses

    • pipeline/processing.R
  3. accessing and transforming UCDP conflict data

    • conflicts/UCDP_lookups.R
  4. accessing regime datasets and coding TJET transitions variables

    • transitions/transitions.R
  5. assembling analyses datasets

    • pipeline/analysis_prep.R
  6. translating parts of the TJET database for the website

    • pipeline/translation.R
  7. writing data to the production database for the TJET website

    • pipeline/sql.R

Running the pipeline.R script will carry out all necessary tasks for moving updated TJET data to the website production database. USE WITH CAUTION! There is a risk of damage to the (staging) website and of incurring costs.

Important note

This repository is made public for the purpose of transparency and replicability. Only those with access to required credentials (read from the local environment) will be able to run all the code.

Folders in this repository

├── conflicts
│   ├── original_data
├── data
│   ├── downloads
├── functions
├── pipeline
├── tjet-datasets
├── transitions 
    ├── original_data 

Processed TJET data which can be used for analyses are contained in tjet-datasets.