sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
HTMLMIT
Stargazers
- zero91
- Haijunlv
- winglianAnnapolis, MD
- duyonganSeattle
- glfpesShanghai, China
- ShadowTeamCN
- singleheartSeoul, Korea
- hjsg1010
- 2003proClear Water Bay, Sai Kung, Hong Kong
- spachava753
- zhaojiong233
- Ryan0v0
- cfeng16Ann Arbor, MI, US
- yliuhzHong Kong SAR
- dqxiuHaidian, Beijing
- we1l1n
- yds1024
- Sirius222
- wangyuxin87Anhui, Hefei
- mengxj08Singapore
- 18907305772Shenzhen
- justHungryMan
- leemengtwTokyo, Japan
- Bowen-nShanghai China
- CHENhush
- kevinng77China
- superqing001
- frt03
- GanjinZeroBeijing
- kiseliuGuangZhou
- ctlllllEarth
- Ryu1845
- rt3722
- VikParuchuriOakland, CA
- abodacs
- progerSupercomputer City