andrewclegg's Stars
dropwizard/dropwizard
A damn simple library for building production-ready RESTful web services.
twitter-archive/kestrel
simple, distributed message queue system (inactive)
addthis/stream-lib
Stream summarizer and cardinality estimator.
epfldata/squall
A streaming / online query processing / analytics engine based on Apache Storm
balajiln/mondrianforest
Code for Mondrian Forests (for classification and regression)
ogrisel/pignlproc
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
andrewclegg/sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
crate/elasticsearch-timefacets-plugin
Elasticsearch Timebased Facets
tamtam180/CityHash-For-Java
CityHashのJava実装 / same as v1.0.0 version, not same v1.1.1
andrewclegg/pig-data-mining-talk
Notes and resources for my talk at the Hadoop UK Users' Group in June 2012
mhausenblas/tride
turning tabular data into entities
cestella/SpatialSearch
Uses Locality Sensitive Hashing to provide a spatial search on top of any distributed or non-distributed key-value store
andrewclegg/datatools
Command line tools for data analysis
andrewclegg/elasticsearch-ls-plugins
Lovely Systems Elasticsearch Plugins
andrewclegg/maven-clojure-template
andrewclegg/qumran
A Clojure wrapper for the Rome syndication library.
andrewclegg/try_git
andrewclegg/xrip
xrip -- pull fields out of large XML documents efficiently.