- Frist of all We do vactorization so we can do NLP (Natural Language Processing) process
- Here we implement open-source software library "spaCy"
- We used TF-IDF (Term Frequency-Inverse Document Frequency) algorithm to score resume
- Then show top candidate score by using sorting algorithm.
- Using numpy, we used gensim model for word2vec
- We used Stack Exchange which is stack overflow's own open source data set around 77 GBs