/LightestTransformer

Albeit not the most powerful product/model, self-attention and Transformer architecture used here was built from scratch.

Primary LanguagePython

Watchers