Issues
- 1
I can't use openMP in nim_2.0, and it needs to put dll files ,like libgomp-1, the same folder to built exe file to execute it.
#43 opened by kiyoken1594 - 4
performance of avx512 bit ops and popcounts
#41 opened by brentp - 1
- 2
Mysterious 2x perf regression on GEMM
#40 opened by mratsim - 6
parallel reduction
#36 opened by brentp - 0
- 2
- 1
- 0
[Lux] Multithreading for JIT code
#31 opened by mratsim - 0
NUMA-aware memory allocation and computation
#30 opened by mratsim - 1
Benchmark example using Intel MKL (for history)
#10 opened by Laurae2 - 3
Regression on GEMM allocation
#27 opened by mratsim - 0
System Profile Dual Xeon Gold 6154
#25 opened by Laurae2 - 1
performance of gemm_strided vs numpy
#23 opened by timotheecour - 1
gemm_strided: error: always_inline function '_mm256_setzero_pd' requires target feature 'xsave'
#22 opened by timotheecour - 1
[GEMM] Enhance serial implementation
#21 opened by mratsim - 0
Fused assignation shortcut
#18 opened by mratsim - 1
- 0
Fast image loading primitives
#17 opened by mratsim - 1
Matrix multiplication: Nested parallelism
#9 opened by mratsim - 3
Exponential: Dual Xeon Gold 6154 result
#11 opened by mratsim - 0
Transpose does not scale well with multithread
#13 opened by Laurae2 - 0
Create a benchmark script
#12 opened by mratsim - 1
Optimised random sampling methods
#8 opened by mratsim - 0
Update for devel OpenMP
#3 opened by mratsim - 0
- 0
[Design] Error model
#2 opened by mratsim - 0
Iteration code size comparison
#1 opened by mratsim