Exercise for DE (HIVE)

  • For the Purose of the exercise, 4 node (m3.large 8GB memory) AWS cluster was spinned with all the configurations of Hadoop and Hive done from scratch.
  • generate_data: Has python scripts for generating data for the purpose of the exercise. The generated file then pushed to HDFS to be used by Hive
  • solution_1: Has hive solution scripts for question 1
  • Solution_2_and_3: Has hive solution scripts for questions 2 and 3