Mahesh3394/training-of-transformer-on-dummy-data
Here we try to understand how transformer works and try to replicate architecture from paper published. Also we will train simple architecture on dummy dataset.
Jupyter NotebookApache-2.0
Here we try to understand how transformer works and try to replicate architecture from paper published. Also we will train simple architecture on dummy dataset.
Jupyter NotebookApache-2.0