/Time-Series-Forecasting-Using-Machine-Learning-Algorithm

Sensor data of a renowned power plant has given by a reliable source to forecast some feature. Initially the work has done with KNIME software. Now the goal is to do the prediction/forecasting with machine learning. The idea is to check the result of forecast with univariate and multivariate time series data. Regression method, Statistical method.

Primary LanguageJupyter Notebook

Forecasting of different sensors' data of a power plant

This respository contains code to foecast data of numerous sensors. Data cleaning and prediction task is done step by step to perform the whole work. In develop branch all of the code will be found.

Requirements

Idea

The idea behind the task is to observe reaction of different machine learning models to the provided data from Salzgitter AG, a reputed steel industry of Germany. Provided data contains information regarding the integrated power plant of Salzgitter AG. Here, I have tried to forecast Turbine data for each minute. Provided data is in time-series format. Initially, all of the raw data is cleaned and visualized using Pandas, NumPy etc. Then, stationarity of time series is checked by ADF test. ARIMA, Linear regression, Decision Tree Regression, Neural Network, Long Short Term Memory(LSTM) are used to do the forecasting.

Useful links for theoretical knowledge

Code to execute

  • After cloning this repository you have to set the name of the csv file where your data is stored. For doing this just take a look in this configuration file
  • In main.py replace the variable name. If this line is not found just try to find dataframe read variable name and use that to read your csv file
  • This file is used for data preprocessing
  • All of the machine learning model is demonstrated here
  • To run code in your favourite IDE/ terminal execute python main.py