/anomaly_detection

Learning Repo for anomaly detection

Primary LanguageJupyter Notebook

Intro to Anomaly Detection

A repo to explore and apply anomaly detection in spark

Sections

0 Getting Started

  • Testing out basic data munging in Spark
  • Testing out window functions / UDFs and UDAFs

1 probabilistic and statistical methods

  • Probabilistic Tail Inequalities

Requirements

Jupyter docker all-spark stack

Testing Data

  • s3 access logs

Quick Notes


docker run -d -it -p 8888:8888 -v /home/brian/Workspace/anomaly_detect:/home/jovyan/work/ jupyter/all-spark-notebook:latest