/DocumentTokenizer

Traces a corpora from text files, in order to find similarity of contents.

This repository is not active