/regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Primary LanguageJupyter NotebookMIT LicenseMIT