/transformers_from_scratch

My own implementation of the transformer (for educational purposes)

Primary LanguagePython

transformers_from_scratch

My own implementation of the transformer (for educational purposes). Includes visualizations of the attention heads in action.

Usage

Run encode_data.py to encode a folder of text data

Run minature.py to train

Run vizualize.py to create visualizations