/DSLP

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Primary LanguagePythonMIT LicenseMIT

Stargazers