similarity-engine Hadoop-based locality sensitive hashing and random projection system for measuring similarity between items. Leverages Pig and python-based user defined functions (UDF).