/cramming

Cramming the training of a (BERT-type) language model into limited compute.

Primary LanguagePythonMIT LicenseMIT

Issues