An easy but opinionated way to train GPT models.
- A way to load a model
- Hyperparameter settings with defaults from good papers
- A way to use datasets directly from huggingface
- A way to monitor training
- Should be able to load a model from anywhere...
- This might be difficult, for now let's leave it as pasting your model code in this repository and then using it to create a script