There are three major parts:
- Disjunctive Normal Form (DNF) formulas generation (rules generation)
- Parity feature generation and selection
- Testing with multiple classifiers
This phase takes a raw data file in .arff format and generate a set of rules in .csv format.
This phase take DNF rules (.csv) and dataset with raw features (.csv) to generate and select best parity features.
The code is written in python with Google Colab notebook. You can also run it locally by downloading the .ipynb file.
In this phase, we compare the performance of raw features and the parity features using several classifiers. The code is written in Java, using the WEKA JAVA API.
Run ‘wekaClassificationTest.java’