How to run custom algos in single process?
Opened this issue · 0 comments
TonyWu199 commented
Hi dear authors, I'd like to know how to run the custom algos in single process mode as mentioned in introduction.
MSCCL-EXECUTOR-NCCL supports an arbitrary number of GPUs installed in a single node or across multiple nodes, and can be used in either single- or multi-process (e.g., MPI) applications.