- Implement TF-IDF algorithm to scan documents and point out the most descriptive words of documents.
- Tools: PySpark Session, PySpark ML Package, Jupyter Notebook.
- Data File: Will be uploaded later.
Report of the project: https://www.notion.so/Term-Frequency-Inverse-Document-Frequency-16bb0c62ead045f18cdbc40dd05c07a1