Source code for the paper:
Estimating the Size of High-risk Populations for COVID-19 Mortality across 442 US Cities
This folder contains the raw data downloaded from various publicly accessible sites to obtain prevalence data on various risk factors.
BRFSS
This folder contains the raw data downloaded from CDC. The BRFSS “500 Cities: Local Data for Better Health, 2019 release provides information on estimates of crude prevalence for covariates related to unhealthy behaviors and health outcomes among adult US residents for 500 cities in the year 2017. The risk factors are BMI, smoking status, hypertension, diabetes, respiratory disease other than asthma, chronic heart disease, stroke, kidney disease, arthritis, asthma.
Census
This folder contains raw files on demographic variables, age, sex and ethnicity at city level downloaded from the publicly available American Community Survey (ACS) data.
Cancer
The USCS data is a combined cancer data source from Centers for Disease Control and Prevention (CDC) and the National Cancer Institute (NCI) that provides statistics on the cancer incidence (newly diagonsed cases) across different cancer sites at county level. This folder contains information on cancer incidence rates. We adjust it with survival rates to obtain cancer prevalence for Haematological and Non-Haematological cancers.
SDI
Information of Social Deprivation Index is obtained from Robert Graham Center and American Community Survey. This folder contains information on county level SDI quintile, which is a prooxy for IMD used in UK analysis.
Geocodes
This folder contains data for mapping counties and cities.
NHANES_Asthma_Diabetes
This folder is required to categorize diabetes prevalences into controlled and uncontrolled group.
Mapping_cities_county and merge_city_data`: code for merging city- and county-level data.
This folder contains all the intermediate datasets created.
This folder contains all codes for the analysis in the paper, including calculating the mean risk in each city, risk for the NHIS individuals, the proportion and size of populations/cases that exceed various high-risk thresholds, etc.
code
This folder contains all codes to create the final set of figures and tables in the paper.
output
This folder contains the final set of figures and tables.
- Jin Jin, Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
- Neha Agarwala, Department of Mathematics and Statistics, University of Maryland, Baltimore County
- Prosenjit Kundu, Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
- Nilanjan Chatterjee, Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health