/LSH_Python

some tricky algorithm regarding Local-Sensitive Hashing Algorithm

Primary LanguagePython

LSH_Python

Those algorithms are for Local-Sensitive Hashing Algorithm and based on UoAuckland COMPSCI 753 course and Stanfrod Uni. Mining of Massive Datasets. They include the following topics:

  1. Document-word shingle matrix
  2. Hashing shingle matrix to signature matrix
  3. Calculate similarity based on the signature matrix. TBC....