bme-map-reduce
Project Overview
Welcome to short notebook that introduce the map/reduce programming paradigm. You will learn how to build the next stage of data exploration pipeline. The map/reduce construction can be implemented as a simple script that runs on local machine, bat the idea stands in the bigger multinode cluster.
Project Instructions
-
(Optional step) Fork the repository to your githab account. It allows you to upload the changes to your own github
-
Clean all the previous copies of this repo in your local home directory (it could be downloaded by other students).
cd ls -all rm -rf bme-map-reduce
-
Clone the repository to your local PC
Go to your home directory, and clone this repository. In case you made a copy (fork) provide your own URL
cd # git clone https://github.com/tstokrk/bme-data-exp.git git clone https://github.com/YOUR_GITHUB_LOGIN/bme-data-exp.git cd bme-data-exp
-
Open the Jupyter Notebook and follow the instructions
jupyter notebook map-reduce.ipynb
-
(Optional step) Commit and push all the changes to your own github repo
git commit -m "My update.." git push origin master