togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
PythonApache-2.0
Watchers
- Ossifragus
- ad12
- andrew-hsiao
- qtvhaoVietnam
- crgone1Cali
- donggyukimcSeoul, Korea
- jensonbSpaceship Earth
- kojikuniLondon, UK
- DKV006
- jfzhang95Singapore
- cyberust
- haikuoxin
- songkq
- mauriceweberZurich
- eemailme
- antocodes
- ZhihongShaoBeijing, China
- ssunqfbeijing
- XReyRobert-IBMFrance
- yotamnahum
- sarakilanyEgypt
- DJJones66
- peiyong-addwater
- hackingco
- thevasudevguptaNew Delhi, India
- kenyandoppio
- kipester
- wx-bPalo Alto, CA
- redrr
- SamimAB
- designfailureSlovenia
- EarthyFox
- hunterm01
- MyDevClouds
- MacJedi42
- CamaradaLares