Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Primary LanguageHTMLMIT LicenseMIT
No one’s watching this repository yet.