-
In this repo I aim to play around with the new RStudio packake sparklyr.
-
Check out the capabilities of the R interface for Apache Spark, but mainly this is for myself messing with this peace of technology.
-
Hopefully, I will document enough that it will make it easier to my future self in case I need to use this toolbox.
- [ X ] Start with the demo localhost
- [ ] Find worthy open dataset to demo
- [ ] Config connection to the Google Cloud Spark cluster
- [ ] Do demo on remote
- [ ] Example cluster computation with open datset