lbendig's Stars
apache/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
twitter/summingbird
Streaming MapReduce with Scalding and Storm
twitter-archive/ambrose
A platform for visualization and real-time monitoring of data workflows
twitter/elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
lintool/MapReduceAlgorithms
Data-Intensive Text Processing with MapReduce
lintool/Cloud9
Cloud9 is a Hadoop toolkit for working with big data
sevdokimov/RegexUtil
lbendig/mucommander
Supports any HDFS version, Quantcast QFS
eyala/gedit-pig
GtkSourceView syntax highlighting for Apache Pig files
lbendig/mucommander-commons-file