Slides from BSides Augusta 2023
This repository includes a full example of the data processing cycle that includes:
- Reading, cleaning, and pre-processing raw data,
- Exploratory Data Analysis (EDA) with statistical and graphical techniques,
- Feature engineering for numeric and categorical features,
- Model training and evaluation.
The contents of the repo are:
- Code: Jupyter notebook named
bsides_aug.ipynb
, - Data: included in directory
data
, - PackagesI
requirements.txt
. Can be installed usingpip install -r requirements.txt
.
You can follow the instructions in vs-code-ml if you want to setup a development environment using VS Code and Jupyter. The Jupyter notebook can be used in Google Colab, but this has not been tested.