LSH

very simple LSH implementation in python. main1.py first add files then compare and find similar one. main2.py use datasketch to do recommendation by LSH algorithm.

dataset

simple text file in files folder are used for similarity calculation. bts_2021_1.csv is a dataset of bts comments on youtube used for recommendation part.