/Hourly_Energy_Consumption_Prediction

This repo contains files and jupyter notebooks for the project- Predicting energy consumption of the entire region in southern CA served by the SDGE (San Diego Gas and electric) utility based on the past 5 years of hourly energy consumption data.

Primary LanguageJupyter Notebook

Predicting hourly energy consumption of San Diego, CA, US

This repo contains files and jupyter notebooks related to the above project. This project focuses on predicting energy consumption of the entire region in southern CA served by the SDGE (San Diego Gas and electric) utility (which comes under CAISO) based on the past 5 years of hourly energy consumption data.

File descriptions:

Contains the initial project proposal

Contains the final project report from data import and EDA to ML

A single notebook file including data import of energy, weather and PV installation data files, statistical analysis, EDA, finding trends among energy, weather and PV installation capacity data. Preparing the data for ML.
NBviewer link for the EDA notebook

Contains the cleaned and merged data with energy time series, added time components, temperature and PV installations. This was used in the ML models.

A single notebook file including all the ML models that were tried on the dataset. Different models including elastic net, random forest, SARIMAX, FB Prophet, XGBoost, XGBoost+FB Prophet were tried with some feature engineering techniques. These models were compared to the baseline forecast which simply repeats the past values.
NBviewer link for the ML notebook

Presentation slide deck for the entire project.


California map showing different electrical utility regions including the San Diego Gas and Electric (SDGE) which was the focus of this project.

Google earth kmz file showing locations of weather stations in US

PV installation dataset key to understand all the columns. Actual PV installations data couldn't be uploaded because it is >50MB. But the link of the data is given in the SDGE_energy_EDA.ipynb in its PV section.

Hourly energy consumption data in MWH for all the 4 utilities of CA.

Hourly energy consumption data in MWH for only SDGE.

NOAA weather data from 2014-18 for SDGE region (data of two stations including the SDGE airport which was used in this project).