/doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Primary LanguageHTMLMIT LicenseMIT

Stargazers