This project try to parallelize the reduction of an array with different approaches. The project is made for Windows.
- Sequential
- Auto-parallelize
- No vector
- OpenMP
- Threads (with futures)
- OpenCL
- CUDA
- SIMD (written but has a bug)
- Auto-parallelize from Microsoft does not work - error 1007: reduction of array to scalar - so it's normal.
In developer command line:
cl /EHsc /O2 /Qpar main.cpp && main
Dugagjin Lashi
MIT