This repository contains all the information used during Wharton Data Science 2023 summer program. Included is all the raw data, their respective sources and any other files used during the presentation.
Kaggle Dataset
All files used in our final presentation can be found in the "Final Model Directory".
If you have questions, please contact at chenjacob@outlook.com
Sources:
Total County Population: https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-total.html
County Police Force Numbers: https://cde.ucr.cjis.gov/LATEST/webapp/#/pages/downloads
County Race Breakdown: https://www.kaggle.com/datasets/mikejohnsonjr/us-counties-diversity-index
Party Affiliation: https://www.pewresearch.org/religion/religious-landscape-study/compare/party-affiliation/by/state/
Internet Access: https://www.fcc.gov/form-477-county-data-internet-access-services
Annual Income by County: https://apps.bea.gov/regional/downloadzip.cfm
Violent Crime by County: https://ucr.fbi.gov/crime-in-the-u.s/2016/crime-in-the-u.s.-2016/tables/table-8/table-8.xls/view
Age Range Data: https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-detail.html