/market_basket_IMDB

Algorithms for Massive Datasets

Primary LanguageJupyter NotebookBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

market_basket_IMDB

Algorithms for Massive Datasets project a.y. 2020/2021
Data Science and Economics

This project presents an analysis on kaggle IMDB dataset provided by ashirwadsangwan.
The object was to implement a scalable Market Basket analysis considering movies as baskets and actors as items, in order to retrieve association rules between the items.

The project was implemented by using Apache Spark environment on Google Colaboratory.