- Use PyPDF2 library to extract text from .pdf file
- Create a dictionary with counter for each identified word
- Filter common stop-words (based on context)
- Load data into Dataframe
- Visualize wordcloud from Dataframe, learnt from this Datacamp tutorial
Insights of commonly used words in the document represented through a word cloud
Jupyter NotebookMIT