/transformer_bashnick

A codebase implementing a simple GPT-like model from scratch based on the Attention is All You Need paper.

Primary LanguagePythonMIT LicenseMIT

Watchers