This is an English Malayalam Parallel corpora which contains around 4 lakh parallel corpora. English sentences are from COCO dataset and it is translated using Google API.
Check Releases section to download processed data.
COCO English Malayalam Parallel corpora which contains 3.6 lakh sentences
Python
This is an English Malayalam Parallel corpora which contains around 4 lakh parallel corpora. English sentences are from COCO dataset and it is translated using Google API.
Check Releases section to download processed data.