Spark Benchmarks Installer & Manager

Summary

This project aims to aid in setting up a cluster via OpenNebula and run benchmarks for Spark.

Prerequisites

  • an OpenNebula-managed cluster;
  • a contextualized virtual machine running Ubuntu 16.04 and with SSH installed.

Description

The file scripts/init.sh is the startup script of your VMs. It does the following:

A script is included which throttles bandwidths between VMs according to a given distribution and interval at which to generate new values.

Usage

  • Instantiate a VM to use as master
  • Set appropriate framework configfurations in the config_files folder
  • Verify scripts/conf/defaults.py settings
  • Run spark_deploy -m <master_ip> to spawn and configure the slave nodes
  • Start up the frameworks of your liking
  • Run benchmarks
  • Graduate :)