/datasets

source{d} datasets ("big code") for source code analysis and machine learning on source code

Primary LanguageJupyter NotebookOtherNOASSERTION

datasets Build Status Build status

source{d} datasets for source code analysis and machine learning on source code.

This repository contains all the needed tools and scripts to reproduce the datasets.

List of available datasets:

Contributions

Contributions are very welcome, please see CONTRIBUTING.md and code of conduct.

License

The tools and scripts are licensed under Apache 2.0, see LICENSE.md.