X-PLUG/mPLUG-Owl

The code and detailed implementation of Figure 4 and Figure 5 in the paper mPLUG-Owl2

Zlatan-Ibrahi opened this issue · 1 comments

I would like to analyze the attention map of my own trained model, but I am not very clear about some details. For example, do we take the average of the attention maps across multiple heads? Could you provide the code for this?

same question, any solutions?