This module converts pdfs to text, sections the resume into main sections, pulls pertinent information, and cleans the data. The results are then aggregated and saved in a convenient summary CSV.
# Install requirements
pip install -r requirements.txt
# Retrieve language model from spacy
python -m spacy download en
# Run code (with default configurations)
cd bin/
python main.py
# Review output
open ../data/output/resume_summary.csv
bin/main.py
: Code entry pointconfs/confs.yaml.template
: Configuration file templatedata/input/resumes
: Resumesdata/output/resume_summary.csv
: Results from parsing example resumesdata/output/resume_sections.csv
: Resumes sectioned into sections (each its own column)