Cramming the training of a (BERT-type) language model into limited compute.
Primary LanguagePythonMIT LicenseMIT
No one’s star this repository yet.