This repository is not active
doudi25/SwiGlu-manual-implementation
i implemented the SwiGLu used in the feedforward method of Llama2 , and i apply its gradient manually
Jupyter Notebook
i implemented the SwiGLu used in the feedforward method of Llama2 , and i apply its gradient manually
Jupyter Notebook
This repository is not active