A Simple Implementation for "Beyond neural scaling laws: beating power law scaling via data pruning"
Primary LanguagePython