Transformer-network-with-pytorch

This is an implementation of transformer network based on "attention is all you need" paper from scratch.