similarity-engine

Hadoop-based locality sensitive hashing and random projection system for measuring similarity between items.
Leverages Pig and python-based user defined functions (UDF).