/playground_sparklyr

Testing the new sparklyr package from RStudio

Playground: sparklyr

  • In this repo I aim to play around with the new RStudio packake sparklyr.

  • Check out the capabilities of the R interface for Apache Spark, but mainly this is for myself messing with this peace of technology.

  • Hopefully, I will document enough that it will make it easier to my future self in case I need to use this toolbox.

Ideas

  • [ X ] Start with the demo localhost
  • [   ] Find worthy open dataset to demo
  • [   ] Config connection to the Google Cloud Spark cluster
  • [   ] Do demo on remote
  • [   ] Example cluster computation with open datset