/MegRay

A communication library for deep learning

Primary LanguageC++OtherNOASSERTION

MegRay

MegRay is a cross-platform communication library providing point-to-point and collective communication methods, such as send, recv, all_gather, all_reduce, reduce_scatter, reduce and broadcast. In the area of deep learning, these methods can be utilized for implementing distributed training framework, including data parallel and model parallel. Currently there are two backends, nccl and ucx, and only cuda platform is supported. In the future, algorithms on more platforms will be added.

Build

  1. prepare third party repositories.
./third_party/prepare.sh
  1. Make a directory for build.
mkdir build
cd build
  1. Generate build configurations by CMake.
cmake .. -DMEGRAY_TEST=ON
  1. Start to build
make