An implementation of locality sensitive hashing
Likelike is an implementation of LSH (locality sensitive hashing) on Hadoop. This program can be used for the nearest neighbor extraction or item recommendation in E-commerce sites. Currently Likelike supports only Min-Wise independent permutations. Min-Wise independent permutations is applied to the recommendation of Google News (Das et al. 2007).
- 2013-01-01 port the code into Github
- 2011-09-26 gave a presentation in the Hadoop conference Japan Fall (slides)
- 2011-08-22 likelike 0.3.0 released switched the build system from Ant into Maven.
- 2010-05-06 likelike 0.2.0 released
Likelike can be applied to following tasks.
- item recommendation in E-commerce sites
- similar documents (or images) extraction
- news recommendation from click through log data
Begin with the Likelike quick start page (QuickStart) which provides the information on the installation and tutorial with small input files. For detailed usage, please visit the page, Usage.
- provide detailed usage page
- provide server function
- nsupport feature selection
- performance tuning
Takahiko Ito <takahiko03 --at-- gmail.com>
- Edith Cohen. Size-Estimation Framework with Applications to Transitive Closure and Reachability. Jour. of Computerand System Sciences. 1997.
- Abhinandan S. Das, Mayur Datar, Ashutosh Garg and Shyam Rajaram, Google news personalization: scalable online collaborative filtering, In: Proc. of WWW conference, 2007.
- Indyk Piotr and Motwani Rajeev. Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality. In: Proc. of Symposium on Theory of Computing. 1998.