/RSC-framework

Resume summarization and classification framework

GNU General Public License v3.0GPL-3.0

RSC-framework

Resume summarization and classification framework

Data

Multilabel resume dataset is obtained from Kaggle The dataset in this project is a cleaned subset of the original dataset. Created extractive summary of resumes in rscdata-X-summary.json and rscdata-X-test-summary.json for the corresponding resumes data files using Bert Extractive Summarizer package. Targets are one-hot encoded and stored in rscdata-y.json and rscdata-y-test.json data files.

MSc. Data Science, University of London, Final Project