gpt from scratch

This isn't a recreation of a specific model, just an exercise in writing a small language model