Pinned Repositories
cassandra-triggers
cassandra-unit
Utility tool to load Data into Cassandra to help you writing good isolated JUnit Test into your application
crab
Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (numpy, scipy, matplotlib).
databus
time series data in cassandra with visualization(NREL's opensource databus project)
dumbo
Python module that allows one to easily write and run Hadoop programs.
genie
Hadoop Platform as a Service
graphx
GraphX development repository (which will eventually be merged into Apache Spark)
h2o
h2o = fast statistical, machine learning & math runtime for bigdata
heap-calculator
Easily calculates and explores common Apache Cassandra heap pressure issues.
hello-samza
Example project using Samza.
MaheedharGunturu's Repositories
MaheedharGunturu/cassandra-triggers
MaheedharGunturu/cassandra-unit
Utility tool to load Data into Cassandra to help you writing good isolated JUnit Test into your application
MaheedharGunturu/crab
Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (numpy, scipy, matplotlib).
MaheedharGunturu/databus
time series data in cassandra with visualization(NREL's opensource databus project)
MaheedharGunturu/dumbo
Python module that allows one to easily write and run Hadoop programs.
MaheedharGunturu/genie
Hadoop Platform as a Service
MaheedharGunturu/graphx
GraphX development repository (which will eventually be merged into Apache Spark)
MaheedharGunturu/h2o
h2o = fast statistical, machine learning & math runtime for bigdata
MaheedharGunturu/heap-calculator
Easily calculates and explores common Apache Cassandra heap pressure issues.
MaheedharGunturu/hello-samza
Example project using Samza.
MaheedharGunturu/incubator-samza
Mirror of Apache Samza
MaheedharGunturu/kafka-s3-consumer
Archive Kafka topics to S3, with Zookeeper support
MaheedharGunturu/kairosdb
Fast scalable time series database
MaheedharGunturu/Kibana
A log analyzing web interface for logstash and elasticsearch. More info at http://www.kibana.org
MaheedharGunturu/lemur
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describes your EMR cluster, local environment, pre- and post-actions and zero or more "steps".
MaheedharGunturu/liblogfaf
Making syslog() not block
MaheedharGunturu/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
MaheedharGunturu/madlib
Open-source library for scalable in-database analytics.
MaheedharGunturu/mahout
Mirror of Apache Mahout
MaheedharGunturu/metrics_storm
Easy metrics collection for Storm topologies using Coda Hale Metrics
MaheedharGunturu/ml
The Cloudera Data Science Team's Tools for Data Preparation, Machine Learning, and Model Evaluation.
MaheedharGunturu/MLI
An API for Distributed Machine Learning
MaheedharGunturu/oozie
Mirror of Apache Oozie
MaheedharGunturu/oryx
Simple real-time large-scale machine learning infrastructure.
MaheedharGunturu/PredictionIO
PredictionIO, a machine learning server for software developers and data engineers.
MaheedharGunturu/pymadlib
A Python wrapper for MADlib (http://madlib.net) - an open source library for scalable in-database machine learning algorithms
MaheedharGunturu/Rhombus
A time-series object store for Cassandra that handles all the complexity of building wide row indexes.