NOSQL_LAB1

Question1 Setup a single node cluster Hadoop.

Question2 Input file consists of NCDC weather data and the output file gives the maximum temperature of two years.

Question3 Input file consists of web access log produced by a web server and the Map Reduce program called "ImageCounter" counts the number of times GIF, JPG, and other image files that have been accessed by clients. The Map reduce outputs three figures number of GIF,number of JPEG and a number of other images.

Question4 With the same input file as above, the Map Reduce program outputs the total number of requests and the total download size (in mega bytes) on monthly basis.

Question5 With the same input file as above, the Map Reduce program lists Timestamp, URL for which http response status has been 404.

The lab content is provided by Prof. PM Jat.