kahypar/mt-kahypar

If the parallel implementation uses SIMD instruction

zlwu92 opened this issue · 1 comments

Hi,

I feel this is a great project. And I'm wondering if this framework utilizes SIMD instruction for parallelization or the multithreaded CPU program? (For example, the deterministic version of mt-kahypar)

We're not using SIMD (beyond what the compiler auto-vectorizes). Our algorithms exhibit sparse random memory access patterns to gather and aggregate data, which to my understanding is not well suited for SIMD. Additionally, there's no heavy arithmetic in performance-critical sections.

We're always open to learn about more ways to improve performance in our code. If you have cool ideas how to use SIMD in a partitioning code, we'd love to hear about them!

Please feel free to reopen the issue, if I haven't adequately addressed your question.

Lars