/Kmeans-using-MapReduce

K-means clustering algorithm using MapReduce.

Primary LanguagePython

Kmeans-using-MapReduce

Building a scalable K-Means clustering system with MapReduce paradigm and gRPC communication. This project showcases the seamless coordination between masters, mappers, and reducers to enable efficient data analysis, fostering cluster discovery and insights.

Tech Stack

Python Protobuf  gRPC

Installation

  1. Clone the repo
  git clone https://github.com/aryanGupta-09/MapReduce.git
  1. Go to the project directory
  cd MapReduce
  1. Generate the Python code for gRPC
  python -m grpc_tools.protoc -I. --python_out=. --grpc_python_out=. kmeans.proto
  1. Update Input/points.txt

  2. Run the Python files