/DIT-BigDataAnalytics

An assignment for the Big Data Analytics course in order to familiarize in the following Big Data applications: Text Classifications (and WordClouds), DeDuplication (with LSH and Machine Learning techniques using feature engineering) and last Sentiment Analysis. For more information read the assignment and my report.

Primary LanguageJupyter Notebook

This repository is not active