togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
PythonApache-2.0
Watchers
- adbarBerlin-Brg. Academy of Sciences (BBAW)
- BinhangYuanHKUST
- blackconKorea
- brataoEscavador
- caix
- canstralian@Dotcomhunters
- christopherok
- craigschmidtWellesley, MA
- crapthingsHarbin, China
- csrisTogether AI
- ctejada85
- damaruBangalore
- danielpclark6ft Dan(TM)
- gradetwo
- hhhaiai@analysys
- ishandutta2007India
- JohnnyOpcodeToronto, Ontario, Canada
- jrmuizelToronto, Canada
- krandiashSan Francisco
- lgshttp://www.linkedin.com/in/lucasoave
- mbofb
- nsl2014fm
- OleNet
- percyliangStanford University
- QubitiumModelCloud.ai
- realworlds46
- sodabeta7Apple
- sukuyaRakuten Group Inc.
- tiendung
- tjadamleeBeijing
- trappedinspacetimeFor Personal Use
- trycatcher
- vipulved@togethercomputer
- wgwangShanghai, China
- winning1120xx
- xmzhaoTencent