/avenir

Set of machine learning tools based on Hadoop and Storm

Primary LanguageJava

Introduction

Set of predictive and exploratory data mining tools. Runs on Hadoop and Storm

Philosophy

  • Simple to use
  • Input output in CSV format
  • Metadata defined in simple JSON file
  • Extremely configurable with tons of configuration knobs

Solution

  • Exploratry analytic including correlation, feature subset selection
  • Naive Bayes
  • Discrimininant analysis
  • Nearest neighbor
  • Decision tree
  • Reinforcement learning

Blogs

The following blogs of mine are good source of details of sifarish. These are the only source of detail documentation

Getting started

Project's resource directory has various tutorial documents for the use cases described in the blogs.

Configuration

All configuration parameters are described in the wiki page https://github.com/pranab/avenir/wiki/Configuration

Help

Please feel free to email me at pkghosh99@gmail.com