/The_dataset_of_large_case_studies_on_mutants_similarity_with_bugs

The dataset of large case studies on mutants similarity, measured both semantically and syntactically, with bugs

Primary LanguageTeXApache License 2.0Apache-2.0

The dataset of large case studies on mutants' similarity with bugs

This repository provides access to the dataset associated with our accepted papers on fault seeding and evaluation of mutation testing techniques. The dataset is designed to support empirical studies, offering insights into the syntactic and semantic aspects of artificially seeded faults a.k.a. mutants, generated by mutation testing tools (PIT, μBERT, iBIR, DeepMutation) and bugs (Defects4J).

Papers

Deep Learning Approach

The source code for the deep learning approach, Cerebro, employed for mutant selection is available here.

Simulation

The source code to perform simulation and to help identify the semantic and syntactic correlation between bugs and mutants is available in Simutate repository.

Citation

To cite our papers or the dataset, please use the BibTeX entries available in cite.bib.

Structure of the Dataset

Important Note: Due to GitHub's restriction on file sizes, the dataset files are zipped to a maximum of 100 MB each.

The dataset is organized as follows:

Please feel free to explore and utilize the dataset for your research and testing evaluations. If you have any questions or need further clarification, please feel free to reach out. Thank you for your interest and collaboration.