parallel-data-analysis
There are 2 repositories under parallel-data-analysis topic.
MultithreadCorner/Hydra
Header only framework for data analysis in massively parallel platforms.
aritra1804/Cohere-LinguaLink
This project uses a multilingual embedding model to align sentences in one language ( preferably a low-resource language) to their potential paired translation in English. The idea is that if we can crawl documents in both languages online (eg from news sites), we can easily pair up sentences that are translations of each other.