/selective-language-modeling

An implementation of selective language modeling (SLM) loss from "Rho-1: Not All Tokens Are What You Need".

Primary LanguagePythonMIT LicenseMIT

Stargazers