/bigdataPractice

big data practice on hadoop and spark

Primary LanguageScala

bigdataPractice

big data practice on hadoop and spark
3 Practices:

  1. Matrix multiplication using hadoop mapreduce
  2. Linear regression with extended features and feature selection using spark scala
  3. breath first search for graph
Use language:
java-hadoop/ scala-spark
Based on theory:
1. linear regression
2. matrix multiplication
3. Breath first search for graph
4. Hadoop
5. Spark
Remark:
: please add related lib .jar for:
1. hadoop-corebr
2. spark-assembly
3. hadoop-core and giraph-core

testing data attached

Supervised by:
Dr.Eric Lo, Hong Kong PolyU