/cramming

Cramming the training of a (BERT-type) language model into limited compute.

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.