Super simple python script to check similarity index using cosine similarity
- sklearn
- PyPDF2
> python similarityChecker.py [-h] -P PATH [-T TOP] -I INDEX
-P : Path to corpus directory
-T : Number of top n identical documents
-I : The index of to be checked document
- Display index of corpus