Label smoothed Aggregation cross entropy loss for generalisation in sequence to sequence tasks.
Primary LanguagePythonMIT LicenseMIT