This is an open-science repository with the necessary data and tooling to replicate our large-scale study of bloated dependencies in the Maven. The data collection is based on our tool: DepClean.
The following sections describe the main materials included in this repository.
The data-collector tool mines the Maven Central repository to obtain usage information about artifacts and their dependencies. The analysis relies on fine-grained static bytecode analysis with DepClean. The tool also uses dedicated Maven plugins to collect data about the projects' architecture and their dependencies.
The nootebooks folder contains a set of R scripts supporting the analysis of the data collected with the data-collector tool.
The contributions folder contains a list of bloated dependencies in open-source projects that have been found with the help of DepClean. These bloated dependencies have been directly communicated to the developer teams through pull requests on GitHub. It contains links to the original code repositories and the actions taken by the developers.
Distributed under the MIT License. See LICENSE for more information.