/MinHashing_Spark

In this repo. , I implement Cosine similarity and MinHashing function ( with and / or operator on band ) to find similarity to specific road in real Traffic dataset using PySpark.

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers