teomandi/DIT-BigDataAnalytics
An assignment for the Big Data Analytics course in order to familiarize in the following Big Data applications: Text Classifications (and WordClouds), DeDuplication (with LSH and Machine Learning techniques using feature engineering) and last Sentiment Analysis. For more information read the assignment and my report.
Jupyter Notebook