/Data_processing

This was created for my Thesis reasearch purpose.To process the data efficiently for input for machine learning algos.

Primary LanguagePython

Data_processing

This was created for my Thesis research purpose.To process the data efficiently for input for machine learning algos.

Data is collected in CSV format for each analysis(malware/clean).

Two folders - Clean
-Malware
Each "xlsx" file conatins 19 unique parameters

Each folder contains ~1000 "xlsx" files.

Program -converts each file to CSV -converts each parameter to different CSV -normalizes data for each parameter -combines clean and malware analysis normalized data in on file for processing -arranges all files in specifc folders.