As the name suggests, this is yet another Hive UDF library. Consists of algorithms that don't already exist in HiveSwarm, brickhouse etc.
Levenstein Distance Damerau-Levenshtein Distance
mvn compile
mvn test
mvn assembly:single
%> hive
hive> ADD JAR target/NAME_OF_ASSEMBLED.jar;
hive> SOURCE sourceAll.hql;
hive> select ldistance(full_name, first_name) from people limit 10;
-> This great article walks through creating java UDFs in Hive.