- Install srcml from www.srcml.org/
- Install all python dependencies by
pip install -r requirements.txt
Download the dataset of each year at github.com/rafed123/gcj-dataset. Install srcml and run code_to_data.ipynb to convert the data to csv.
The results of analysis can be found in Jupyter notebooks gcj_analysis_*.ipynb