This project aims to provide a ready-to-use Apache Spark cluster solution for study and development purposes. For production environment the user must configure properly.
This repository contains:
- Base image with ubuntu 14.04, Java Oracle 8, Apache Spark 2.3 with Hadoop 2.7 support
- Spark Master Node
- Spark Workers
- Sample Applications in Python 3 and Scala 2.11
Maintainer: Eduardo Le Masson