A 🤗-style implementation of BERT using lambda layers instead of self-attention
Primary LanguagePythonMIT LicenseMIT