Azure/msccl-executor-nccl

How to run custom algos in single process?

Opened this issue · 0 comments

Hi dear authors, I'd like to know how to run the custom algos in single process mode as mentioned in introduction.
MSCCL-EXECUTOR-NCCL supports an arbitrary number of GPUs installed in a single node or across multiple nodes, and can be used in either single- or multi-process (e.g., MPI) applications.