/ML_FeatureAnalysis

Primary LanguagePythonMIT LicenseMIT

ML_FeatureAnalysis

This research project aims to create a tool that would aid researchers who are not familliar with machine learning, in applying machine learning methods within their research. The researcher would only have to provide the initial data, and the various modules are meant to aid in data pre-processing, applying models, and analyzing the results. For physics simulations the sensitivities of the output calculated with respect to the inputs have been shown to be good machine learning features. The predicted quantity for the machine learning algorithm can be any quantity of interest in the physics simulation, so that the machine learning algorithm can begin to learn relationships of interest.

Once machine learning has been performed on a data set, feature ranking can be calculated to find the most important features for predicting the quantity of interest to the machine learning algotihm. This can provide unique insights about the problem space. An example applying this methodology is currently in review for publication. In that study the features are nuclear cross section sensitivities calculated with respect to k (the output of the physics simulation), and the predicted value is the error of the simulation. Finding the important features to the machine learning model now shows what cross sections are important to predicting error, which can be used to help identify sources of error in the problem space.