Intro to Anomaly Detection
A repo to explore and apply anomaly detection in spark
Sections
0 Getting Started
- Testing out basic data munging in Spark
- Testing out window functions / UDFs and UDAFs
1 probabilistic and statistical methods
- Probabilistic Tail Inequalities
Requirements
Jupyter docker all-spark stack
Testing Data
- s3 access logs
Quick Notes
docker run -d -it -p 8888:8888 -v /home/brian/Workspace/anomaly_detect:/home/jovyan/work/ jupyter/all-spark-notebook:latest