sparkbook This repo contains the source code from the chapters in the book titled Spark: Big Data Cluster Computing in Production.