Is there a bug?

Question

sunnyxiaohu opened this issue 9 months ago · 1 comments

Answer 1 · 2024-08-09T06:35:35.000Z

According to line40, 'attn_output = self.pv_matmul(attn_weights, value_states)' equals to 'attn_output = torch.matmul(attn_weights, value_states)'.