optimize-gemm: A Jupyter Notebook repository from axmat

Optimize single-threaded General Matrix Multiplication (GEMM) fo two square matrices

Transpose the second matrix

mkdir build
cd build
cmake .. -DCMAKE_CXX_COMPILER=clang++
cmake --build . --

export $OMP_NUM_THREADS=1
./bench-gemm [dim_size]