/7331-machine-learning-1

Part of SMU's Master's of Data Science Program

Primary LanguageJupyter Notebook

DS7331: Machine Learning 1

Part of SMU's Master's of Data Science Program

Group 4 Team Members

  • Megan Ball
  • Amber Clark
  • Matt Farrow
  • Blake Freeman

Data Description

The data set that our group selected came the U.S. Department of Transportation's Bureau of Transportation Statistics and "tracks the on-time performance of domestic flights operated by large air carriers" during 2015 (Kaggle). The data is broken up into three separate .csv files: one with flight details for 5,819,079 flights during 2015, one with 14 different U.S.-based airlines, and one with the geographic details of 322 U.S. airports.

The data is important because it takes airline delays and cancellations, one of the aspects of airline travel that is popular fodder for public complaints, and quantifies it in a way that offers the possiblity of tangible analysis. For the purposes of this analysis, the two variables that we intend to measure are ARRIVAL_DELAY and CANCELLATION.

Projects

In-Class Assignments

  • ICA 1 (link)
  • ICA 2 (link)
  • ICA 3
  • ICA 4
  • ICA 5