surajkarki66/tranformer-from-scratch
In this repository, I have tried implementing the state of the art transformer model from scratch, and trained the model for just one epoch to check.
Jupyter Notebook
In this repository, I have tried implementing the state of the art transformer model from scratch, and trained the model for just one epoch to check.
Jupyter Notebook