/spark-movie-lens

Various examples of analytics using Apache Spark

Primary LanguageScala

Apache Spark Analytics Examples

MOVIELENS DATASET BASED MOVIES RECOMMENDATION

  • Dataset :ml-20m
    • Data size (Ratings : 20000263, Tags : 465564, Movies : 27278, Users : 138493)
    • Data files (genome-scores.csv, genome-tags.csv, links.csv, movies.csv, ratings.csv, tags.csv)
  • Different use cases
    • All top rated movies
    • Count all movies
    • Count all movies of a perticular genre e.g. 'Mystery' or 'Action' etc.
    • Get all movies of a perticular genre e.g. 'Mystery' or 'Action' etc.
    • Get movie recommendation based on user choice
  • References