GoogleCloudPlatform/covid-19-open-data

Calculate US State Aggregations of Static Covariates

gserapio opened this issue · 2 comments

This is a stretch of an enhancement request, but I thought I'd ask!

The United States is rich with COVID-19 data that is segmented by state, but many basic static covariates provided in the Demographics, Economy, and Health tables are unavailable at the US state-level. Having these variables on hand could significantly enhance future research looking more closely at COVID-19 in the US. The following variables by US state (aggregation_level == 1) seem derivable from data already included in this repo or available from an external source:

Demographics Table

  • population_rural
  • population_urban
  • population_density
  • human_development_index (state-level data available on Wikipedia [https://en.wikipedia.org/wiki/List_of_U.S._states_and_territories_by_Human_Development_Index] sourced from [https://globaldatalab.org/shdi/shdi/USA/?levels=1%2B4&interpolation=1&extrapolation=0&nearest_real=0])

Economy Table

  • gdp
  • gdp_per_capita

Health Table

  • (all variables)

One might also be able to further estimate a subset of the population and health variables for US counties, though that would be a greater undertaking.

Thanks for the suggestion. This is certainly feasible, and we already try to source as many of these variables as we can for subnational locations — mainly using Wikidata.org as the source.

We can certainly add the globaldatalab.org data source for the state-level HDI variable. For the other variables, do you have a specific data source in mind?

Great!

Demographics Table

I'm not sure how population_rural vs population_urban has been defined across different countries on your team's end, but I think this data at the state level should be available from the US Census. I've had difficulty finding a table for 2019 on their website, however. It probably can be extracted from raw American Community Survey (ACS) data.

Economy Table

I was able to find GDP at the state level on Wikidata.org, citing this Wikipedia page, citing the Bureau of Economic Analysis.

Health Table

Adult cigarette use by US state: https://www.cdc.gov/statesystem/cigaretteuseadult.html
Countyhealthrankings.org from the University of Wisconsin seems to have most of the Health Table variables available by US state for 2019.