Apache Spark Analytics Examples
MOVIELENS DATASET BASED MOVIES RECOMMENDATION
- Dataset :ml-20m
- Data size (Ratings : 20000263, Tags : 465564, Movies : 27278, Users : 138493)
- Data files (genome-scores.csv, genome-tags.csv, links.csv, movies.csv, ratings.csv, tags.csv)
- Different use cases
- All top rated movies
- Count all movies
- Count all movies of a perticular genre e.g. 'Mystery' or 'Action' etc.
- Get all movies of a perticular genre e.g. 'Mystery' or 'Action' etc.
- Get movie recommendation based on user choice
- References