Various NLP models I write the NLP models from : bidirectional GRU encoder and GRU decoder with feedforward attention mechanism. Standared Transformer. BERT model. Transformer-XL model. XL-Net model. All of those models are written by Tensorflow-2.0 beta