document-aligner
There are 2 repositories under document-aligner topic.
bitextor/bitextor
Bitextor generates translation memories from multilingual websites
transducens/parallel-urls-classifier
Parallel URLs Classifier (PUC) infers the parallelness of a pair of documents from their URLs