Manually optimize the GEMM (GEneral Matrix Multiply) operation. There is a long way to go.
Primary LanguageC++