/similarity-check

Simple Python Script to Check Similarity Index on Documents using Cosine Similarity

Primary LanguagePython

About

Super simple python script to check similarity index using cosine similarity

Dependencies

  1. sklearn
  2. PyPDF2

How to Use It

> python similarityChecker.py [-h] -P PATH [-T TOP] -I INDEX

-P : Path to corpus directory
-T : Number of top n identical documents
-I : The index of to be checked document

TODO

  1. Display index of corpus