/Veil

:bride_with_veil: Python utility for de-identification of csv files in accordance with 45 CFR § 164.514

Primary LanguagePython

Veil: Deidentify data

Veil is a collection of utilities to help deidentify datasets. Currently, the main purpose of Veil is to provide an easy way to create a reference table for an ID column that is sensitive and to map it to any number of downstream dataframes. Veil relies on Pandas to do most of the heavy lifting, and simply provides a way to concisely deidentify and reidentify columns where necessary.

TODO:

  • Add module to work with date columns
  • Provide more saving formats for id_veil.save()
  • Add tests

Protected Health Information (PHI)

The de-identification criteria for protected health information in the United States is described here

License

MIT