Introduction

This project is created mainly to understand the transformer model of Neural Machine Translation. The most modules of the code has been taken from- http://nlp.seas.harvard.edu/2018/04/03/attention.html

Pre-requisites

Train toy model on synthesized data.

$python train_toy.py

Train a real de-en model using default configuration. To download the prepared data use the following spacy command.

$python -m spacy download en
$python -m spacy download de

$python train.py