Introduction-to-Hadoop

Hadoop is an open source software that utilized the network of many computers to solve problem involving massive amounts of data and computation. It is a framework for distributed storage and processing of big data using the MapReduce model.