/cs246_mining_massive_datasets

yulong's note about cs246_mining_massive_datasets

Primary LanguageJupyter Notebook

Course

CS246: Mining Massive Data Sets

Youtube list

Schedule

topic start date note
L3 - Locality Sensitive Hashing 5/31, 21 Review 6/2, 21
L4 - Theorem of LSH 6/6, 21

Code Study

topic name start date note
lsh lsh cosine by random projection 20210606
lsh lsh, yelp near duplicate review detection cosine 20210601

Reference

PiotrTa/Mining-Massive-Datasets 8+ stars

LSH random projection code

LSH - random projection post

LSH - Jaccard implementation github

random projection vs pca

Bucketed Random Projection LSH 算法Spark+Hadoop+Redis實現