Pinned Repositories
AveMujicaChk
An upgraded version of MujicaChk, which supports fast checkpoints for multiple nodes
gdr_python
This Description based by gpu_dircect_rdma_access. Hoping provide some tools to python gdr access. More information in MujicaChk
GPUdirect_rdma_Access
This repository based by Mellanox/gpu_direct_rdma_access. Some errors in the code have been modified, some methods have been optimized, and some features have been added
Mujica
In-memory Checkpoint.
MujicaChk
Mujica Checkpoint:LLM fast and low overhead checkpoint and flash recovery
NLPRead
This repository creatd as a repostiory of my NLP learning
Sonnet
Copy repository of DeepSpeed-0.13.0-Sonnet version
Vertin
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ind1x1's Repositories
Ind1x1/GPUdirect_rdma_Access
This repository based by Mellanox/gpu_direct_rdma_access. Some errors in the code have been modified, some methods have been optimized, and some features have been added
Ind1x1/AveMujicaChk
An upgraded version of MujicaChk, which supports fast checkpoints for multiple nodes
Ind1x1/gdr_python
This Description based by gpu_dircect_rdma_access. Hoping provide some tools to python gdr access. More information in MujicaChk
Ind1x1/Mujica
In-memory Checkpoint.
Ind1x1/MujicaChk
Mujica Checkpoint:LLM fast and low overhead checkpoint and flash recovery
Ind1x1/NLPRead
This repository creatd as a repostiory of my NLP learning
Ind1x1/Sonnet
Copy repository of DeepSpeed-0.13.0-Sonnet version
Ind1x1/Vertin