Various NLP models

I write the NLP models from :

  1. bidirectional GRU encoder and GRU decoder with feedforward attention mechanism.
  2. Standared Transformer.
  3. BERT model.
  4. Transformer-XL model.
  5. XL-Net model. All of those models are written by Tensorflow-2.0 beta