/Airline-Delay-Prediction-using-Spark-and-Kylin

Building a prediction model for a huge dataset using Big Data tech like Kylin and Spark.

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Airline-Delay-Prediction-using-Spark-and-Kylin

Mapper and Reducer purpose is to detect and replace null values by column average.

The model is built in PySpark using Decision Tree Classifiers.

Dataset