/llm-data

LLM Training Data

Primary LanguageJupyter NotebookMIT LicenseMIT

LLM Data

This repository is mainly about cleaning, converting and checking LLM training datasets.

Datasets

New datasets cleaned and created by this project:

Licensing

Copyright (c) 2024 Philip May

Licensed under the MIT License (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License by reviewing the file LICENSE in the repository.