MaheedharGunturu's Stars
deanhiller/databus
time series data in cassandra with visualization(NREL's opensource databus project)
facebookarchive/liblogfaf
A library that logs messages using non-blocking UDP datagrams.
kairosdb/kairosdb
Fast scalable time series database
hmsonline/cassandra-triggers
klbostee/dumbo
Python module that allows one to easily write and run Hadoop programs.
amplab/MLI
An API for Distributed Machine Learning
madlib/archived_madlib
MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.
amplab/graphx
Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
vmware-archive/pymadlib
A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms
jpatanooga/Metronome
Suite of parallel iterative algorithms built on top of Iterative Reduce
joaquincasares/heap-calculator
Easily calculates and explores common Apache Cassandra heap pressure issues.
apache/oozie
Mirror of Apache Oozie
Netflix/genie
Distributed Big Data Orchestration Service
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
TheClimateCorporation/lemur
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describes your EMR cluster, local environment, pre- and post-actions and zero or more "steps".
h2oai/h2o-2
Please visit https://github.com/h2oai/h2o-3 for latest H2O
muricoca/crab
Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (numpy, scipy, matplotlib).
apache/predictionio
PredictionIO, a machine learning server for developers and ML engineers.