Pandas data processing Jupyter notebook for blog post. The datasets are taken from the Kitsune Network Attack Dataset. You will need to download the files: OS_Scan.pcap, SYN_Dos.pcap
and export these to csv. The Mirai csv is already included, the other two files were too large to be in the repo.
All the Python dependencies can be installed using poetry. Follow the steps below:
-
Install dependencies using:
poetry install
. -
Switch to the poetry virtual environment:
poetry shell
. -
Run:
which python
and copy the full path of the Python interpreter to.vscode/settings.json
substituting the variable:"python.pythonPath": <full_path_to_python_env_var>
-
Change the Jupyter notebook kernel to the poetry venv Python path.
Note: The OS_Scan_dataset.csv
is not included in the data because of the file size (5 GB). You will need to download the pcap from Kitsune Network Attack Dataset, add two columnes for source and destination port, and export the pcap to a csv file.