TraceMeOut

Can a Neural Net learn to differentiate between user tasks by watching at the Kernel Trace? Does this accuracy defines a metric of how reverse-engineerable a system is? 🤩

How to run

1️⃣ To start collecting data, run the following

sudo ./collect.sh

2️⃣To process the data, run the following.

./process.sh

Profiling yields that +90% of the time is spent at build_data and parser in pythonscripts/Processor3.py, so an algorithm in C/FORTRAN to convert yamls to adjacency lists is a great way to contribute :-)

3️⃣ To train the neural net and check the results, run

./trainAI.sh

But it's quite more comfortable to take advantage of GoogleColab's GPU by using the following notebook. In fact, it carries the latest version of network architecture.

./utils/TraceMeOut_4GoogleColab.ipynb

Configuration

The file called configuration.py allows the modification of the neural net's training and architecture hyperparameters. It also allows to change the size of the sliding window and sampling ratio. The equivalent of the configuration.py file if training in Colab is a Jupyter Netbook cell. For more info about what each parameter means, see "Model".

Model

The sampling and neural network model are as follows: in particular it requires collecting data and then processing it, as opposed to training the neural net online. It also does not support predicting over a live kernel stream, it is basically an offline experiment.

Results

The following are results for the simple case of a binary classification task defined by listening to music on YouTube Vs having the video paused. The database for that is composed by the trace of 4 CPU cores, currently part of the repo in the directory processed_trace. There are 20-30 secs. of trace data sampled at 100 microseconds, which have led to ~5Mb databases per core. If using Google Colab, which is recommended as it carries the "latest" version, the data is automatically curled as part of the code, and about 4Gb of Tensorflow's TFRecord dataset files are produced.

The conclusion for this case is that in fact the network was able to produce at least some learning, noted by the upward trend in accuracy over epochs and the relatively small overfitting as defined by the training and validation gap.

Neural Network Architecture

The following diagram shows the chosen neural network achitecture for the case of a 4-processor machine, which corresponds to the one where the data included in this repo was collected.

Tree Encoding

Ftrace's function_graph defines a graph. The following diagram shows how it is encoded into integer-valued tensors, which in turn are later 'normalized' to their respective [0,1] as part of the neural net's preprocessing.

Requirements

A linux kernel equipped with Ftrace and function_graph (e.g. Ubuntu 18 and 20)
Python3 equipped with Tensorflow(+2.5), NumPy and Pandas.
Last but not least, the hypothesis that the kernel trace contains 'some' information from which a neural net can learn 😉

GastonMazzei/TraceMeOut

TraceMeOut