Fast avx2/fma3 dgemm and sgemm subroutines for medium to large matrices(>2000*2000) on haswell/skylake/zen processors, with performances comparable to MKL.
Primary LanguageCGNU General Public License v3.0GPL-3.0
No issues in this repository yet.