chenjieen/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
PythonApache-2.0
Stargazers
No one’s star this repository yet.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
PythonApache-2.0
No one’s star this repository yet.