/indonesian-mt-data

Benchmarking Multidomain English-Indonesian Machine Translation

Primary LanguageRoff

This repository contains training and evaluation split for our paper Benchmarking Multidomain English-Indonesian Machine Translation.

Data for conversational domain is currently on hold for legality review. Alternatively, you may download Open subtitle data as follow:

http://opus.nlpl.eu/download.php?f=OpenSubtitles/v2018/moses/en-id.txt.zip