Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
RSP-QL
A home of RSP-QL syntax and semantics discussion
GitLab
The first lab should get you used to getting code from gitLab, creating your own repository and submitting code.
Lab1_ConfigurationVagrant
Installing and running your VM - Vagrant or Cloudera QuickStart VM
Lab4_HivePigMovielens
For this Lab, we will use the MovieLens Small Dataset to examine the functions of HIVE and PIG.
EMR-on-AWS
Useful resources to set up EMR on AWS remaining on the free tier
Lab2_HadoopMR
This is the second lab, writing and running a basic hadoop program
Lab3_HivePig
For this Lab, we will install HIVE and PIG and start practicing with their basic functions separately.
Lab5_Storm
This lab asks you to set up a simple Storm topology.
Big-Data-Cloud
amileo's Repositories
amileo/spark
Apache Spark - A unified analytics engine for large-scale data processing