KleidiAI and matrix multiplication
rwalkr opened this issue · 1 comments
rwalkr commented
The description of FMOPA on https://learn.arm.com/learning-paths/cross-platform/kleidiai-explainer/page1/:
FMOPA: This instruction stands for Floating-point outer product and accumulate. It is included in the Arm Scalable Vector Extension 2 (SVE2). The single precision FMOPA variant enables optimized matrix multiplication on 32-bit numbers. View the FMOPA documentation here .
incorrectly says it's part of SVE2 - it's SME
pareenaverma commented