/opennames

An open database of persons of interest and politically exposed persons

Primary LanguagePythonMIT LicenseMIT

opensanctions data pipeline

The codebase for OpenSanctions, an open-source database of sanctions data, politically exposed persons, and other entities of interest. This repository contains the code used to parse, clean, and deduplicate source data and build the combined database.

OpenSanctions uses Follow the Money, a JSON-based anti-corruption data model, as a common target for all crawlers. Additonal exports into CSV and JSON formats are planned.

Technical overview

Repository layout:

  • opensanctions/: Python project with data extraction and cleaning components
  • docs/: Sphinx technical documentation

Related repositories:

Daily data extraction and processing runs on GitHub Actions. Status:

production

Licensing: code is MIT-licensed, content & data is CC 4.0 Attribution-NonCommercial.