/docker-spark-cluster

This project aims to provide a ready-to-use Apache Spark cluster solution for study and development purposes

Primary LanguageShell

Docker-Spark-Cluster

This project aims to provide a ready-to-use Apache Spark cluster solution for study and development purposes. For production environment the user must configure properly.

This repository contains:

  • Base image with ubuntu 14.04, Java Oracle 8, Apache Spark 2.3 with Hadoop 2.7 support
  • Spark Master Node
  • Spark Workers
  • Sample Applications in Python 3 and Scala 2.11

Maintainer: Eduardo Le Masson