/GPT_Neo_fine-tuning_notebook

Primary LanguageJupyter NotebookGNU Affero General Public License v3.0AGPL-3.0

GPT_Neo_fine-tuning_notebook

This notebook walks through a more conventional way of fine-tuning GPT Neo before going on to use DeepSpeed to fine-tune the larger GPT Neo models.

Youtube video walkthrough can be found here