source{d} datasets for source code analysis and machine learning on source code.
This repository contains all the needed tools and scripts to reproduce the datasets.
List of available datasets:
Contributions are very welcome, please see CONTRIBUTING.md and code of conduct.
The tools and scripts are licensed under Apache 2.0, see LICENSE.md.