Mismatch between Figure 3a and Equation 5 in paper
krasserm opened this issue · 1 comments
krasserm commented
Thank you for the very interesting paper and your plan to release the code. Since there is no initial code release yet (at the time of opening this issue), I have an implementation-related question: the lightweight transformer layer
whereas Figure 3a looks more like
Which one is correct i.e. is used in the implementation?
andydelworth commented
I am also very interested in the answer to this question