/Attention-Is-All-You-Need--reproduced

Coded transformer architecture from scratch and created modular framework for experimenting, with suppport for training, hyperparameter search, and serverless deployment.

Primary LanguagePython

Stargazers