/cavemangpt

Primary LanguagePythonApache License 2.0Apache-2.0

Caveman GPT

A very basic implementation of the GPT model.

References

@article{radford2019language, title={Language Models are Unsupervised Multitask Learners}, author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya}, year={2019} }