k v compression in attention may causes small targets and detail to be lost？

Question

Opened this issue 9 months ago · 0 comments

In attention， to reduce the calculation amount, kv is compressed， small targets and detail will lose ？