/hadoop_mapReduce

Database course project

Primary LanguagePython

hadoop_mapReduce

Environment: CentOS on Virtual Machine. Data are from cloudera.

  • Problem: Giving a sales breakdown by product category across all of stores P1_mapper & P1_reducer
  • Problem: Find the highest value of single sale in each store P2_mapper & P2_reducer
  • Problem: The total number of sales and the total sales value from all the stores P3_mapper & P3_reducer