/building-gpt

Primary LanguagePythonMIT LicenseMIT

Building a GPT model

This is a repository used to learn and train a Generatively Pretrained Transformer (GPT). It is an educational process that allow one to comprehend how a basic text file can be converted into a transformer model that can perform conversational task. Basically, you learn to understand the properties of what a GPT is.

Requirements

This uses a conda environment.

Start

Start by going through building-gpt.ipynb.