/Spark_SimilarityDocs

A PySpark app which implements a MapReduce algorithm to compute the pairwise document similarity in a large document dataset

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Stargazers

No one’s star this repository yet.