An exploration of optimized decoding in LLMs using a parallel variation of Gauss-Seidel decoding.
AndreSlavescu/Intermediate-Gauss-Seidel-Decoding
An exploration of optimized decoding in LLMs using a parallel variation of Gauss-Seidel decoding.
Jupyter Notebook