Python implementation of k-means clustering algorithm in MapReduce.
- Hadoop Installation
- Dataset Creation
- createDataset.py
- Plot of data points
- K-means Clustering Algorithm
- Instructions for running k-means in Cloudera
- run.sh & reader.py
- run.sh
- reader.py
- MapReduce
- mapper.py
- reducer.py
- Plot Representation