andrewclegg's Stars
balajiln/mondrianforest
Code for Mondrian Forests (for classification and regression)
andrewclegg/try_git
andrewclegg/pig-data-mining-talk
Notes and resources for my talk at the Hadoop UK Users' Group in June 2012
andrewclegg/elasticsearch-ls-plugins
Lovely Systems Elasticsearch Plugins
andrewclegg/sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
mhausenblas/tride
turning tabular data into entities
tamtam180/CityHash-For-Java
CityHashのJava実装 / same as v1.0.0 version, not same v1.1.1
epfldata/squall
A streaming / online query processing / analytics engine based on Apache Storm
cestella/SpatialSearch
Uses Locality Sensitive Hashing to provide a spatial search on top of any distributed or non-distributed key-value store
crate/elasticsearch-timefacets-plugin
Elasticsearch Timebased Facets
andrewclegg/maven-clojure-template
andrewclegg/qumran
A Clojure wrapper for the Rome syndication library.
andrewclegg/xrip
xrip -- pull fields out of large XML documents efficiently.
addthis/stream-lib
Stream summarizer and cardinality estimator.
andrewclegg/datatools
Command line tools for data analysis
dropwizard/dropwizard
A damn simple library for building production-ready RESTful web services.
ogrisel/pignlproc
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
twitter-archive/kestrel
simple, distributed message queue system (inactive)