Original paper use RNN(LSTM?), here we use transformer like Attention Is All You Need
- bleu: 0.9665[total: 1.000]
- BLEU: 47.75[total: 54.59]
python ./src/train.py
Original paper use RNN(LSTM?), here we use transformer like Attention Is All You Need
python ./src/train.py