/DataFlow

Google Cloud DataFlow example with movie ratings

Primary LanguageJava

Google Cloud DataFlow example

We show with two simple tasks, how to process the data with Google Cloud DataFlow. Our solution extracts basic statistics from the data, which can serve as an input to a statistical program for in depth analysis.

Check resources/datasets folder for more info about datasets.

Task 1: Movie ratings

  1. How many movies does each user rate?
  2. Is the movie app used more by females or males?
  3. Which gender watches more movies, males of females?
  4. Perhaps males and females rate movies differently. Is there a difference in ratings between genders, which gender rates movies with higher ratings, is this difference significant?

Task 2: Button display time

  1. How to optimize the display time duration of the button shown in the app?