/US-Accidents-Data-Analysis

Analyzing US accident dataset to find out the main reasons for accidents in US.

Primary LanguageJupyter Notebook

US-Accidents-Data-Analysis

Analyzing US accident dataset to find out the main reasons for accidents in US.

This is a countrywide car accident dataset, which covers 49 states of the USA. The accident data are collected from February 2016 to Dec 2020, using multiple APIs that provide streaming traffic incident (or event) data. These APIs broadcast traffic data captured by a variety of entities, such as the US and state departments of transportation, law enforcement agencies, traffic cameras, and traffic sensors within the road-networks. Currently, there are about 1.5 million accident records in this dataset.

Acknowledgements

  • Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, and Rajiv Ramnath. “A Countrywide Traffic Accident Dataset.”, 2019.
  • Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, Radu Teodorescu, and Rajiv Ramnath. "Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights." In proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM, 2019.

Questions

We will find answers to questins like: -

  • Which city in US has reported the most number of accident cases?
  • Which 5 states reported the highest number of accident cases?
  • Which street is most accident prone in US?
  • What time of the day are accidents most frequent in?
  • Which days of the week have the most accidents?
  • What is the trend of accidents year over year (decreasing/increasing?)
  • On which side of the road most of the accidents occured?
  • How did the weather conditions affected the cases?

Dataset

Link to the US Accidents dataset: https://www.kaggle.com/sobhanmoosavi/us-accidents