Mellanox/libvma

Shared resource among vma clients

ps142020 opened this issue · 2 comments

I am running an application with 12 of them launched on 12 different cores vs 1 core. I expected when all 12 process run on single core latency should increase due to obvious but turns out vma performs much better than 12 cores case. What is the shared resource here? i am not even running vma. Also from mlx_fs_dump i always see 12 queues. Is there any info on details of vma architecture?

Any update on this?