/hadoop-on-colab

An example demonstrating setting up a Hadoop / Spark environment on Google Colab

Primary LanguageJupyter NotebookMIT LicenseMIT

Hadoop on Colab

This repository contains how to install Apache Hadoop 3.2.3, configure it and run it on Google Colaboratory The notebook has the following sections:

  1. Installing Java 8
  2. Installing Secure Shell Server (SSHD)
  3. Installing Hadoop 3.2.3
  4. Running Hadoop in Standalone mode
  5. Running Hadoop in Pseudo-distributed mode