sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
HTMLMIT
Stargazers
- aleien95
- alphadlJD Explore Academy, JD.com Inc.
- BladeSun
- CPFLAME
- Daemon-serHefei
- dayyassSocial Discovery Group
- evdcush
- fly51flyPRIS
- gawei1995
- Giruvegan
- HuangLKsysu
- jammyWolf
- KeepABC
- L1aoXingyuBeijing, China
- labixiaoKBeijing
- Ldpe2GSun Yat-sen University
- love1life
- MARD1NOSiliconFlow
- MarkWuNLPMicrosoft Research
- Moddus
- ncoop57
- Olivia-fsmEcole Polytech Federal of Lausanne
- ovbystrovaSnap Inc
- SandalotsVolcanak
- sepilqiAnonymous
- slyviacassell
- sudahui
- taisazeroPhD Student - UNC@Charlotte
- TianHongTaoBeiJing
- TonyNemoGuangzhou, China
- ToSev7enNCU
- tytemp
- virtualzx-nadCandidate Labs
- whr94621Nanjing University
- Zhangyunyan77
- ZhishuaiGuoTexas A&M University