ayunimatulf/anomalydetection

Jupyter Notebook

Anomaly Detection

Overview

Detect anomaly based on sensor data from NASA Acoustics and Vibration Database.

EDA

Sensor data from 2014-02-13 till 2014-02-19 with total 982 records. Data line plot and box plot.

From above charts seems Bearing 1 has a lot value higher than the normal value.

Method

Here I tried 2 methods below:

Principal Components Analysis (PCA)

Transform the record from 4 points into 2 points

Check what is the distance of the record and the centroid of overall data using simple euclidean distance
Decide the threshold based on euclidiance data distribution

The yellow one is data that flagged as anomaly and it quite distinctfull compare to other points.

Autoencoder

Define the model architecture and fit the data with X = features and y = X
Check the mean loss from the network output and the original data values
Check the distribution of loss values to define the threshold (9n here I used 0.04 as the threshold)

Result Visualization :

The result both between PCA and autoencoder is quite same, but if we see the image below seems PCA way alot faster compare to the Autoencoder.