/Document-Winnowing

Implementation of the plagiarism-detection algorithms behind MOSS

Primary LanguagePython

Document Winnowing

Implementation of the plagiarism-detection algorithms behind MOSS

Coming soon:

Preprocessing logic for real documents, not toy data

Ability to upload documents and check them against a database of collected files

Web interface in Django for hosting above

Pretty much everything - super early stages

See: http://igm.univ-mlv.fr/~mac/ENS/DOC/sigmod03-1.pdf