/county_projections_official

Complete replication code for county-level population projections

Primary LanguageRMIT LicenseMIT

Introduction

Here I provide the code for: Population projections for all U.S. counties by age, sex, and race controlled to the Shared Socioeconomic Pathways

Citation

Hauer, M.E. Open Science Framework. (DOI 10.17605/OSF.IO/9YNFC) (YEAR ACCESSED).

Abstract

Small area and subnational population projections are important for understanding long-term demographic changes. I provide county-level population projections by age, sex, and race in five-year intervals for the period 2015-2100 for all U.S. counties. Using historic U.S. census data in temporally rectified county boundaries and race groups for the period 1990-2015, I calculate cohort-change ratios (CCRs) and cohort-change differences (CCDs) for eighteen five-year age groups (0-85+), two sex groups (Male and Female), and four race groups (White NH, Black NH, Other NH, Hispanic) for all U.S counties. I then project these CCRs/CCDs using ARIMA models as inputs into Leslie matrix population projection models and control the projections to the Shared Socioeconomic Pathways. I validate the methods using ex-post facto evaluations using data from 1969-2000 to project 2000-2015. My results are reasonably accurate for this period. These data have numerous potential uses and can serve as inputs for addressing questions involving sub-national demographic change in the United States.

Main Figure Projected numeric population changes for the five SSPs between 2020 and 2100 for counties in the continental United States.

Organization

  • SCRIPTS — Scripts and output for figures included in the documentation.
  • DATA — Initial data resources, unprocessed.
  • DATA-PROCESSED — Processed data used to create the projections.
  • FIGURES — output figures.
  • PROJECTIONS — contains two subfolders. PROJECTIONS/EVAL/ contains the out-of-sample validation for the period 2000-2015. PROJECTIONS/PROJECTIONS/ contains the actual projections.

Use

  • Feel free to create a new branch for further incorporation and analysis .
  • More information in is located in each folder DATA.

Data

The projected populations by age/sex/race/county/year/SSP for all US counties for the period 2020-2100 are available at the Open Science Foundation https://dx.doi.org/10.17605/OSF.IO/9YNFC.

Data resulting from these projections can be found in the file SSP_asrc.csv.zip.

Codebook

YEAR

  • Refers to the year of each projection. 2020 -- 2100 in five-year increments.

STATE

  • Refers to the two-digit FIPS code associated with each state.

COUNTY

  • Refers to the three-digit FIPS code associated with each county in each state.

GEOID

  • Refers to the five-digit FIPS code associated with each individual county and is the product of the STATE two-digit code and the COUNTY three-digit code.

AGE

  • 0-4 = 1
  • 5-9 = 2
  • ...
  • 80-84 = 17
  • 85+ = 18

SEX

  • Male = 1
  • Female = 2

RACE

  • White NH = 1
  • Black, NH = 2
  • Hispanic = 3
  • Other, NH =4

Correspondence

For any issues with the functionality of these scripts please create an issue.

License

The data collected and presented is licensed under the Creative Commons Attribution 3.0 license, and the underlying code used to format, analyze and display that content is licensed under the MIT license.