big data practice on hadoop and spark
3 Practices:
- Matrix multiplication using hadoop mapreduce
- Linear regression with extended features and feature selection using spark scala
- breath first search for graph
1. linear regression
2. matrix multiplication
3. Breath first search for graph
4. Hadoop
5. Spark
: please add related lib .jar for:
1. hadoop-corebr
2. spark-assembly
3. hadoop-core and giraph-core
testing data attached
Dr.Eric Lo, Hong Kong PolyU