Pinned Repositories
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
caffe
Caffe: a fast open framework for deep learning.
CuMF
CUDA-Acclerated ALS on mulitple GPUs
dlrover
DLRover: An Automatic Distributed Deep Learning System
elasticdl.github.io
ftlib
Fault-tolerant for DL frameworks
tests_and_issues
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
util-tf
word2vec
Automatically exported from code.google.com/p/word2vec
skydoorkai's Repositories
skydoorkai/caffe
Caffe: a fast open framework for deep learning.
skydoorkai/CuMF
CUDA-Acclerated ALS on mulitple GPUs
skydoorkai/dlrover
DLRover: An Automatic Distributed Deep Learning System
skydoorkai/elasticdl.github.io
skydoorkai/ftlib
Fault-tolerant for DL frameworks
skydoorkai/tests_and_issues
skydoorkai/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
skydoorkai/util-tf
skydoorkai/word2vec
Automatically exported from code.google.com/p/word2vec