/bigmining

Distributed Data Mining and Machine Learning algorithms on top of Apache Hadoop, Apache Giraph, Apache Hama

Primary LanguageJava

Computing frameworks

  • Apache Hadoop Mapreduce (Well known implementation of map-reduce)
  • Apache Hama (General BSP-based distributed computing framework)
  • Apache Giraph (Distributed graph computing framework)
  • Apache Spark (In-memory distributed computing framework)
  • Apache Storm (Real time distributed computing framework)

Implemented Algorithms

Hadoop Mapreduce

Ridge Regression trained by coordinate descent

Lasso Regression trained by gradient descent