/spark-benchmark

Data mining benchmark using Spark

Primary LanguagePython

spark-benchmark

Data mining benchmark using Spark

This benchmark uses:

  • Frequency Term algorithm (raw and spark-only)
  • Word2vec (Google)
  • Naive Bayes (Classifier)
  • K-Means (Clustering)