BPE, various NORM in deep learning
Opened this issue · 1 comments
Albert-Ma commented
Neural Machine Translation of Rare Words with Subword Units
dropout
batch normalization
layer norm
Albert-Ma commented
subword tokenization: https://zhuanlan.zhihu.com/p/38546218