This is a research project, not an official NVIDIA product.
- Sequence to sequence learning
- Different cell types: LSTM, GRU, GLSTM, SLSTM
- Encoders: RNN-based, unidirectional, bi-directional, GNMT-like
- Attention mechanisms: Bahdanau, Luong, GNMT-like
- Beam search for inference
- Data parallel multi-gpu training
- Distributed (data-parallel) multi-node training using Horovod
- LARS norm scaling algorithm