/bme-map-reduce

An intro notebook to map/reduce programming paradigm

Primary LanguageHTML

bme-map-reduce

Project Overview

Welcome to short notebook that introduce the map/reduce programming paradigm. You will learn how to build the next stage of data exploration pipeline. The map/reduce construction can be implemented as a simple script that runs on local machine, bat the idea stands in the bigger multinode cluster.

Project Instructions

  1. (Optional step) Fork the repository to your githab account. It allows you to upload the changes to your own github

  2. Clean all the previous copies of this repo in your local home directory (it could be downloaded by other students).

    cd
    ls -all
    rm -rf bme-map-reduce
    
  3. Clone the repository to your local PC

    Go to your home directory, and clone this repository. In case you made a copy (fork) provide your own URL

    cd
    # git clone https://github.com/tstokrk/bme-data-exp.git
    git clone https://github.com/YOUR_GITHUB_LOGIN/bme-data-exp.git 
    cd bme-data-exp
    
  4. Open the Jupyter Notebook and follow the instructions

    jupyter notebook map-reduce.ipynb
    
  5. (Optional step) Commit and push all the changes to your own github repo

    git commit -m "My update.."
    git push origin master