/data

UCLA Law COVID-19 Behind Bars Data

GNU General Public License v3.0GPL-3.0

logo

UCLA Law COVID-19 Behind Bars Data

Background

The UCLA Law COVID-19 Behind Bars Data Project, launched in March 2020, tracks the spread and impact of COVID-19 in American carceral facilities and advocates for greater transparency and accountability around the pandemic response of the carceral system. Since March, we have been collecting and reporting facility-level data on COVID-19 in prisons, jails, and other correctional centers.

  • Latest data: Our latest data on COVID-19 in carceral facilities is maintained in this repository.
  • Historical data: Our historical data is maintained in our historical-data repository. We are in the process of cleaning this data and will be adding additional states as this data becomes available.
  • Additional data: We also collect information about pandemic-related prison and jail releases, legal filings and court orders bearing on the safety of incarcerated people, and grassroots organizing campaigns and fundraisers here.

Our Process

Our core dataset includes information on COVID-19 cases, deaths, and tests across more than 1,500 state, federal, county, and immigration correctional facilities. We maintain this dataset by scraping and standardizing data from more than 80 sources. We scrape this data 3-4 times each week, although correctional agencies vary in how often they update their data. Our scraper production code and more detailed documentation are available on GitHub.

The majority of the facilities that we collect data on fall under state jurisdiction, where COVID-19 data is reported on state Department of Correction (DOC) websites. We also collect data from federal prisons reported by the Federal Bureau of Prisons (BOP), immigration detention centers reported by Immigrations and Customs Enforcement (ICE) and from several large county jail systems – including Los Angeles, New York City, Philadelphia, Maricopa County, Orange County, Cook County, and Hennepin County.

We are continuously adding to and refining our scrapers. Where possible, we have also retrospectively added COVID-19 data for facilities using digital archives.

Contributors: Our data for several jails in California is collected by Davis Vanguard, who have been generously sharing their COVID-19 data with us. Our data for state prisons in Massachusetts is reported by the ACLU of Massachusetts. If you would like to contribute data on COVID-19 in a facility that we don't currently include, please see our template. We always welcome additional contributors!

Our Data

Our core dataset includes the following metrics reported separately for incarcerated people and staff at the facility level:

  • Cumulative COVID-19 cases
  • Cumulative COVID-19 deaths
  • Active COVID-19 cases
  • COVID-19 tests administered

We also collect additional information based on what agencies report (e.g. population data and vaccination data). While we aim to collect facility-level data, not all jurisdictions report COVID-19 metrics at the facility level. Some DOCs only report statewide totals, and others do not report any data for certain metrics. Authorities also vary dramatically in how they define the metrics that they report. We do our best to standardize these variables, but comparing data across jurisdictions and over time should be done with caution.

Note: Jurisdictions are continuously updating how, where, and whether they update their data. We do our best to accurately collect as much data as possible, but our data availability is subject to change.

Data Dictionary

The full set of variables that we report includes the following:

Variable Description
Facility.ID Integer ID that uniquely identifies every facility
Jurisdiction Whether the facility falls under state, county, federal, or immigration jurisdiction
State State where the facility is located
Name Facility name
Date Date data was scraped (not necessarily date updated by the reporting source)
Source Source from which the data was scraped
Residents.Confirmed Cumulative number of incarcerated individuals infected with COVID-19
Staff.Confirmed Cumulative number of staff infected with COVID-19
Residents.Deaths Cumulative number of incarcerated individuals who died from COVID-19
Staff.Deaths Cumulative number of staff who died from COVID-19
Residents.Recovered Cumulative number of incarcerated individuals who recovered from COVID-19
Staff.Recovered Cumulative number of staff who recovered from COVID-19
Residents.Tadmin Cumulative number of COVID-19 tests administered to incarcerated individuals
Staff.Tested Cumulative number of staff tested for COVID-19
Residents.Negative Cumulative number of incarcerated individuals who tested negative for COVID-19
Staff.Negative Cumulative number of staff who tested negative for COVID-19
Residents.Pending Number of incarcerated individuals currently with pending test results for COVID-19
Staff.Pending Number of staff currently with pending test results for COVID-19
Residents.Quarantine Number of incarcerated individuals currently in quarantine from COVID-19
Staff.Quarantine Number of staff currently in quarantine from COVID-19
Residents.Active Number of incarcerated individuals currently infected with COVID-19
Population.Feb20 Population of the facility as close to February 1, 2020 as possible
Residents.Population Current population of incarcerated individuals reported by agency website
Residents.Tested Cumulative number of incarcerated individuals tested for COVID-19
Residents.Initiated Cumulative number of incarcerated individuals who have initiated COVID-19 vaccination (i.e. received any dosage of a vaccine)
Residents.Completed Cumulative number of incarcerated individuals who have fully completed their COVID-19 vaccination schedule
Residents.Vadmin Cumulative number of COVID-19 vaccines administered to incarcerated individuals
Staff.Initiated Cumulative number of staff who have initiated COVID-19 vaccination (i.e. received any dosage of a vaccine)
Staff.Completed Cumulative number of staff who have fully completed their COVID-19 vaccination schedule
Staff.Vadmin Cumulative number of COVID-19 vaccines administered to staff
Address The facility's address
Zipcode The facility's zipcode
City The facility's city
County The facility's county
Latitude The facility's latitude
Longitude The facility's longitude
County.FIPS The facility's 5-digit county FIPS code
HIFLD.ID The facility's corresponding Homeland Infrastructure Foundation-Level Data ID

Accessing Our Data

This repository contains the latest values that we scraped for a given facility. We are currently in the process of cleaning our full historical time series data and integrating population data to more readily compute COVID-19 rates across facilities over the course of the pandemic. This data is available for several states here. All of our time series data since November is available here.

We are developing an R package behindbarstools, which includes a variety of functions to help pull, clean, wrangle, and visualize our data. We recommend using this package to access our latest data.

To access post-November time series data in R:

devtools::install_github("uclalawcovid19behindbars/behindbarstools")

data <- behindbarstools::read_scrape_data(all_dates = TRUE, coalesce = TRUE)

To access post-November time series data in Python:

import pandas as pd 

data = pd.read_csv("http://104.131.72.50:3838/scraper_data/summary_data/scraped_time_series.csv")

Citations

Citations for academic publications and research reports:

Sharon Dolovich, Aaron Littman, Kalind Parish, Grace DiLaura, Chase Hommeyer, Michael Everett, Hope Johnson, Neal Marquez, and Erika Tyagi. UCLA Law Covid-19 Behind Bars Data Project: Jail/Prison Confirmed Cases Dataset [date you downloaded the data]. UCLA Law, 2020, https://uclacovidbehindbars.org/.

Citations for media outlets, policy briefs, and online resources:

UCLA Law Covid-19 Behind Bars Data Project, https://uclacovidbehindbars.org/.

License

Our data is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. That means that you must give appropriate credit, provide a link to the license, and indicate if changes were made. You may not use our work for commercial purposes, which means anything primarily intended for or directed toward commercial advantage or monetary compensation.