/document-similarity

Tool to identify similar documents within a corpus and decide whether to keep or remove them

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Stargazers