/MiniALBERT

This repository contains the code used for training/fine-tuning the models introduced in the paper "MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers".

Primary LanguagePythonMIT LicenseMIT

Watchers