/Hindi-Urdu-Machine-Translation

Phrase Based Machine Translation Model and various Neural Machine Translation Models for translations between Hindi and Urdu languages

Primary LanguagePython

Hindi-Urdu-Machine-Translation

Approach:

We tried both statistical machine translation models, and neural machine translation models for this task. The dataset can not be made public as it was part of a contained study, but these models can be used for any 2 languages.

Statistical Machine Translation (SMT) model:

We used a phrase based SMT model, with Giza++ to get the word alignment.

Neural Machine Translation (NMT) model:

We had 4 models in place:

  • A baseline seq-2-seq model, using LSTM
  • Neural Machine Translation By Jointly Learning To Align And Translate (paper link)
  • Effective Approaches to Attention-based Neural Machine Translation (paper link)
  • Modeling Coverage for Neural Machine Translation (paper link)

Presentation for this can be found here where the details of the implementation has been explained in detail, along with the results.

Built and tested using Python3 on Linux.

Authors:

Saurabh Chand Ramola, Sumukh S