ADS Group Project
Data Source: https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95
Cleaned and filtered by date: https://drive.google.com/open?id=1sWmIGU5ygXvGBtBuEr1rLNBEOmrk4J41
- Data Wrengling:
- Using Random Forest and Linear Regression successfully explain the periodicity of the accidents using hour and day of week features. Out-sampel R2 0.83
upload a preliminary analysis of 1 & 2.
upload another cleaned version of data (combine duplicate and error values in the factor columns, and other minor adjustment)
https://drive.google.com/file/d/1mK-HZ5E4_F14AJxzEmCZH0TlEABVlDlR/view?usp=sharing
Upload some spatial analysis results by using Global Morans1, Local Morans1 and Hot spot analysis(Getis-Ord Gi) methods in ArcGIS.
Here is the interpretation of spatial method index and results:https://docs.google.com/document/d/1Eb1rYd87xf2somXTrXmGEAu3jGRHCTBKQKtS5KjTA9k/edit?usp=sharing
https://docs.google.com/spreadsheets/d/1Ntb3kQKs7u6WLdqH2ydqpBR2M3-9o0ViwKX5goGdP-4/edit#gid=0
https://drive.google.com/file/d/1Ll87UVKchfhQgJVLm4i0uwpG14VXlkE_/view?usp=sharing
https://drive.google.com/file/d/1yfMkTj98YPVfQbCXlrmG9qCAxDkN2YlZ/view?usp=sharing
I have uploaded notebook with regression analysis.