/SoftwareTagRecommender

A tag recommender based on SOTA machine learning algorithms to automatically recommending tags to software repositories.

Primary LanguageJupyter NotebookCreative Commons Attribution 4.0 InternationalCC-BY-4.0

Software Tag Recommender

This repository contains all our exploration on the use of machine learning methods to automatically recommend tags for software repositories.

It currently contains the following folders:

  • data-preparation: Our scripts for pre-processing the data, the data containing rules for preprocessing tags and generating sub-topics. Also, we include the final dataset of preprocessed sub-topics along with their featured Github topics and set of aliases.

  • machine-learning: Our python scripts for training various machine-learning-based algorithms.

Paper and appendix

The paper can be found on ArXiv: link