Slides and samples used in Distributed Computing with Spark talk.
First example is a simple snippet used for guess the most retweeted tweet of a bunch of them. It also explore some options at deploying embeded Spark cluster and some basic features.
Same example as before, but using SparkSQL syntax...
sbt run