This repository was created while working on a DataKind datadive for ADL.
Currently, this is mostly data cleaning and munging. There are some interesting datasets in the output folder.
incidents_supplemented_2.csv
combines the state level aggregated hate crime data with census population estimates, enhanced penalty law information, other statute information, and creates per capita measurements of hate crime types- the
race_corrected
column is therace_ethnicity_ancestry
column for 2015, and for prior years it is therace
andethnicity
columns added together. The definitions changed in 2015. Hopefully this is a fair comparison
- the
table_12_2004-2015.csv
is the yearly agency reporting data (aka "table 12") from the FBI UCR Hate Crime Statistics. https://ucr.fbi.gov/hate-crimeincident_aggregated_with_agencies.csv
is the supplemented incident data above merged with the table 12 agency data for the years 2004-2015.- There is a calculated field labled
percent_population_covered
which is thepopulation covered
field divided by theest_pop
(estimated population) field
- There is a calculated field labled
For the adventurous, there's some attempted modeling in the modeling notebook as well.