/ADS2018

ADS Group Project

Primary LanguageJupyter Notebook

ADS2018

ADS Group Project

Data Source: https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95

Cleaned and filtered by date: https://drive.google.com/open?id=1sWmIGU5ygXvGBtBuEr1rLNBEOmrk4J41

Update 12/13: Goal for Next Steps

  1. Data Wrengling:

Update 12/13 wyw238:

  1. Using Random Forest and Linear Regression successfully explain the periodicity of the accidents using hour and day of week features. Out-sampel R2 0.83

Update 11/18 wyw238:

upload a preliminary analysis of 1 & 2.
upload another cleaned version of data (combine duplicate and error values in the factor columns, and other minor adjustment) https://drive.google.com/file/d/1mK-HZ5E4_F14AJxzEmCZH0TlEABVlDlR/view?usp=sharing

Updata_Jingxi_11/20

Upload some spatial analysis results by using Global Morans1, Local Morans1 and Hot spot analysis(Getis-Ord Gi) methods in ArcGIS.
Here is the interpretation of spatial method index and results:https://docs.google.com/document/d/1Eb1rYd87xf2somXTrXmGEAu3jGRHCTBKQKtS5KjTA9k/edit?usp=sharing

Adding Categories:

https://docs.google.com/spreadsheets/d/1Ntb3kQKs7u6WLdqH2ydqpBR2M3-9o0ViwKX5goGdP-4/edit#gid=0

Contributing factors added to data:

https://drive.google.com/file/d/1Ll87UVKchfhQgJVLm4i0uwpG14VXlkE_/view?usp=sharing

Data with boroughs joined along with other data

https://drive.google.com/file/d/1yfMkTj98YPVfQbCXlrmG9qCAxDkN2YlZ/view?usp=sharing

I have uploaded notebook with regression analysis.