title
README

Purpose

This repo contains resources that summarize certain concepts in Spark or that help to perform certain operation with Spark. The scripts in this repos are made to make spark Debugging as easy as possible.

ExecutorSizing

This piece of code is a simplistic helper to size your spark executors correctly based on certain limits. Its based on this wonderful article

Ipython

Gives certain guidance on how to setup Ipython notebook with Spark and Yarn

MemoryTesting

When running Spark on yarn i faced many times spark executors dying while performing certain computations. The overall issue likely is Java 8 requesting too much virtual memory and in this folder im debugging this issue more thoroughly.