sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
HTMLMIT
Stargazers
- KeepABC
- aleien95
- sepilqiAnonymous
- gawei1995
- virtualzx-nadSan Mateo, CA
- Olivia-fsmLausannne, Switzerland
- CPFLAME
- sudahui
- tytemp
- ncoop57
- L1aoXingyuBeijing, China
- love1life
- Ldpe2GGuang Zhou, China
- MARD1NONeverland
- taisazeroCharlotte, NC
- fly51flyBeiJing
- Sandalots
- TianHongTaoBeiJing
- Daemon-serHefei
- slyviacassell
- evdcush
- alphadlShanghai(CN) & Sydney(AU)
- ZhishuaiGuoCollege Station, TX
- ovbystrovaLondon, UK
- dayyassDubai, UAE
- jammyWolf
- Zhangyunyan77
- Giruvegan
- labixiaoKBeijing
- MarkWuNLPBeijing, China
- whr94621Nanjing, Jiangsu Province, China
- ToSev7en
- BladeSun
- TonyNemoGuangzhou, China
- Moddus
- HuangLK