thammegowda/mtdata

Add parallel bible corpus

Opened this issue · 1 comments

First appeared here in https://aclanthology.org/L14-1215/

which references link: http://paralleltext.info/data/

but that link is no longer available.

However, recently https://arxiv.org/pdf/2109.05772.pdf mention that they used it. In their own words:

The Parallel Bible Corpus (PBC) by Mayer and Cysouw (2014) is a multi-parallel corpus spanning 1259 languages and up to 30k verses per translation.

TODO: find a download link and include in our index

I learned via my connections in the bible translation community that bible translations are copyrighted.
So we are probably not going to find bible translations for low res langs at the moment.
Tagging this as invalid for now, until someone makes bible translations open.