Exercise for DE (HIVE)
- For the Purose of the exercise, 4 node (m3.large 8GB memory) AWS cluster was spinned with all the configurations of Hadoop and Hive done from scratch.
- generate_data: Has python scripts for generating data for the purpose of the exercise. The generated file then pushed to HDFS to be used by Hive
- solution_1: Has hive solution scripts for question 1
- Solution_2_and_3: Has hive solution scripts for questions 2 and 3