/transformer-nmt

This project is created mainly to understand the transformer model of Neural Machine Translation.

Primary LanguagePythonApache License 2.0Apache-2.0

Introduction

This project is created mainly to understand the transformer model of Neural Machine Translation. The most modules of the code has been taken from- http://nlp.seas.harvard.edu/2018/04/03/attention.html

Pre-requisites

  • python >= 3.6
  • pytorch >= 0.5
  • numpy
  • torchtext
  • spacy (to train the de-en model)

Quick Start

Train toy model on synthesized data.

$python train_toy.py

or

Train a real de-en model using default configuration. To download the prepared data use the following spacy command.

$python -m spacy download en
$python -m spacy download de
$python train.py