Experiments to distill transformer models to RNN models
Primary LanguageJupyter NotebookMIT LicenseMIT