/transformer_generalization

The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.