Data-Science-and-Openshift-on-the-MOC: A Jupyter Notebook repository from BU-CLOUD-S20

Data Science and Openshift on the MOC

Pull request accepted by RedHat into main repo on 06/11/2020.

Click here for a comprehensive Project Report

Our GitHub repo containing our Prometheus Anomaly Detector can be found here

The Final presentation video for our project can be found here.

The slides for this presentation can be found here.

Understanding the current ML models (SARIMA and Prophet) and workings of Prometheus Anomaly Detection (PAD), an existing open-source project developed by the AI team at RedHat to analyze time-series data generated by cloud infrastructure
Developing and deploying a new ML model (LSTM) onto DataHub, a blueprint for building an AI application as a service platform on the OpenShift Container Platform.
Testing the workings of DataHub on the MOC and reporting any problems for future ramifications.

The open-source community
MOC users that want to analyze time-series data generated by cloud infrastructure and send alerts for potential anomalies
Site Reliability Engineers that would like to monitor the health of their applications

Ensuring the deployment and functionality of DataHub on the MOC
Extension of the current Prometheus Anomaly Detection tool with a new model(s)
Exploring/reporting any issues and bugs we encounter along with the project for future ramifications

Developing at least one new ML model (LSTM) and having it monitor any metric while running on DataHub.

Project familiarization (02/05)
Deploy DataHub onto MOC (03/01)
Set up Data Hub on Openshift (03/15)
Extend Prometheus with a new model(s) using Jupyter notebooks (04/15)
Test the models on Prometheus using data provided by the mentor (05/01)
Migrate built ML Algorithms and models onto DataHub (05/05)