Pinned Repositories
elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
FileSetInputFormat
A Hadoop input format for sending lists of files as keys to a mapper. Set the list of files, and an input split will be created per file. Each map task gets only one input key: the filename for its split.
hadoop-lzo
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
IntegerListInputFormat
An input format for divvying up a range of input values to Hadoop mappers. Set the min, max, and number of splits, and each mapper will get an approximately equal number of input values.
pig.tmbundle
Simple syntax highlighting for writing Pig scripts (http://hadoop.apache.org/pig) in Textmate.
piglet
Piglet is a DSL for writing Pig scripts in Ruby
protobuf.tmbundle
Simple syntax highlighting for working with Protocol Buffers (http://code.google.com/p/protobuf) in Textmate.
scribe
Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensible without client-side modification, and robust to failure of the network or any specific machine.
smile
actor-based memcache client library
stream-to-hdfs
A simple utility for streaming stdin to a file in HDFS
kevinweil's Repositories
kevinweil/hadoop-lzo
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
kevinweil/pig.tmbundle
Simple syntax highlighting for writing Pig scripts (http://hadoop.apache.org/pig) in Textmate.
kevinweil/protobuf.tmbundle
Simple syntax highlighting for working with Protocol Buffers (http://code.google.com/p/protobuf) in Textmate.
kevinweil/stream-to-hdfs
A simple utility for streaming stdin to a file in HDFS
kevinweil/elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
kevinweil/FileSetInputFormat
A Hadoop input format for sending lists of files as keys to a mapper. Set the list of files, and an input split will be created per file. Each map task gets only one input key: the filename for its split.
kevinweil/scribe
Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensible without client-side modification, and robust to failure of the network or any specific machine.
kevinweil/IntegerListInputFormat
An input format for divvying up a range of input values to Hadoop mappers. Set the min, max, and number of splits, and each mapper will get an approximately equal number of input values.
kevinweil/piglet
Piglet is a DSL for writing Pig scripts in Ruby
kevinweil/smile
actor-based memcache client library
kevinweil/fuzzy_text_matcher
Emulate Textmate's cmd+T for arbitrary lists
kevinweil/Google-Visualization-Graph-Fail
An example showing a Google Viz API area chart that renders in Safari but not Firefox.
kevinweil/libra
Libra’s mission is to enable a simple global currency and financial infrastructure that empowers billions of people.
kevinweil/mootools-datepicker
Smoothly animating, very configurable and easy to install. No Ajax, pure Javascript.
kevinweil/Pingdom-SOAP-API
A reminder to myself about how to work with (shudder) WSDL, SOAP, and Java.
kevinweil/malloy
Malloy is an experimental language for describing data relationships and transformations.