MegRay is a cross-platform communication library providing point-to-point and collective communication methods, such as send, recv, all_gather, all_reduce, reduce_scatter, reduce and broadcast. In the area of deep learning, these methods can be utilized for implementing distributed training framework, including data parallel and model parallel. Currently there are two backends, nccl and ucx, and only cuda platform is supported. In the future, algorithms on more platforms will be added.
- prepare third party repositories.
./third_party/prepare.sh
- Make a directory for build.
mkdir build
cd build
- Generate build configurations by
CMake
.
cmake .. -DMEGRAY_TEST=ON
- Start to build
make