Table of Contents

About the project

This project is built to demonstrate Google Cloud Platform ETL capabilities, creating machine learning models to detect anomalies and visualizing obtained results.

Dataflow diagram

Dataflow diagram

Outliers visualization

Outliers visualization

Anomaly Detection approach

Anomalies are detected using K-Means clustering algorithm implemented on BigQuery ML service. Details could be found there: BigQuery k-means tutorial