MLCCI-Experiment

The Spectrum-Based Fault Localization (SBFL) technique is widely used for identifying and pinpointing the source of bugs in software. Despite ongoing research efforts to improve SBFL techniques, they can still be hindered by the presence of Coincidental Correct (CC) test cases in test suites. These test cases can negatively impact the accuracy of SBFL. To address this issue, we propose a new approach, the Machine Learning-based CC Test Case Identification (MLCCI), which utilizes multiple features extracted from the program under test to identify and eliminate CC test cases.

Runtime environment

python 3.10

Executing the MCLLI method experimental code

Executing the main function of featureExtract.py, obtain the results of four feature calculations.
Executing the main function of MeRF_leave_one.py, calculating CC recognition results, including Recall, Precision, FPR and F1-score. In addition, obtaining the random forest's training model.
Executing the main function in Location.py to calculate the list of suspicious statements.
Executing the main function in FaultMe.py to get the metrics of fault localization which contains Wasted Effort and Accuracy@N.

CC test cases identification effectiveness

	recall	precision	Fmeasure
Chart	70.68%	76.45%	61.43%
Closure	56.41%	65.97%	40.60%
Lang	82.34%	70.97%	68.25%
Math	72.15%	80.35%	63.75%
Mockito	53.90%	65.10%	36.93%
Time	43.12%	70.93%	40.67%

Fault Localization effectiveness

	sus_formula	top-5	top-3	top-1
Chart	jaccard_Relabeling	21	18	17
Closure	jaccard_Relabeling	112	107	102
Lang	jaccard_Relabeling	48	37	35
Math	jaccard_Relabeling	139	137	132
Mockito	jaccard_Relabeling	10	10	8
Time	jaccard_Relabeling	23	23	15
total	jaccard_Relabeling	353	332	309

runexperiment/MLCCI-Experiment

MLCCI-Experiment

Runtime environment

Executing the MCLLI method experimental code

CC test cases identification effectiveness

Fault Localization effectiveness