bigdata_fundamentals_S4
This repository is a set of my practical activities (awesome code) during my class at ENSET-M S4 II-BDCC 2.
Topics
-
Interacting with HDFS using command line. Source Code, Report.
-
Interacting with HDFS using Java. Source Code, Report.
-
MapReduce Job to find the total sales by city. Source Code, Report
-
MapReduce Job to find the total sales by city in a specific date. Source Code, Report
-
MapReduce Job to find the min/max salary by department. Source Code, Report
-
MapReduce Job to find the number of employees by department. Source Code, Report
-
MapReduce Job to find min/max temperature by month (1916). Source Code, Report
-
MapReduce Job to find min/max temperature by year (1908-1916, multiple input files). [Not Ready]
-
K-means implementation with Java. Source Code
-
K-means implementation using MapReduce (Points clasturing). Source Code, TP, Report
-
K-means implementation using MapReduce (Image Processing). Source Code, TP[Report]
Spark
-
Three Spark tasks (WordCount, sells total by city and sells total by date_city). Source Code, TP
-
Analyser les données météorologiques fournies par NCEI (National Centers for Environmental Information) à l'aide de Spark. Source Code, TP
-
Initializing in SparkSQL TP (Dataframes & datasets using Java). Source Code, DETAILS