Missing UCX RMA support for 8/16bit MPI atomics
lundgren87 opened this issue · 0 comments
The ArgoDSM atomics currently make use of atomic MPI operations of size 8, 16, 32 and 64 bits. The choice of MPI datatype is made through the below function or its unsigned or float equivalent.
argodsm/src/backend/mpi/mpi.cpp
Lines 64 to 94 in dc8d789
OpenMPI (and most definitely MPICH as it is bundled) nowdays pushes Infiniband users towards using UCX. However, UCX does not provide full RMA support for 8 and 16 bit atomics, instead falling back to active messaging for these (if supported at all).
UCX Documentation
Related issue
Some of the ArgoDSM backend tests (atomicXchgAll
, atomicXchgOne
) currently fail with "unsupported datatype" when forcing the selection of the UCX osc module or when other alternatives are disabled. For both performance (avoiding active communication) and compatibility reasons, perhaps it would be better to perform at least a properly aligned 32-bit atomic operation instead of 8/16?