/renocosts

Taking open data from the city of Reno to merge payroll, fire and police data.

Primary LanguageJava

Takes open data from Reno's city government and combines the data to form a new dataset with new insights. The data comes from http://dashboard.reno.gov/. The project combines Reno's open payroll data with police and fire call data. It calculates the cost per call based on the city's payroll data.

This project uses Java for cleansing the data sources and Hive (HQL) to run queries on the resulting data.

To try out this project, you will need Hadoop and Hive. Run the HQL files through Hive using the hive -f filename.hql syntax.

Steps:

Run table_create_1_paytables.hql

Run table_create_2_costpersecond.hql

Run table_create_3_fireandpolicecalls.hql

Run table_create_4_merged_fire_and_police.hql

Run the sample_queries.hql to see some usages of the data