R0n12
CSE PhD working on High Performance Deep Learning at OSU NOWLAB
The Ohio State UniversityColumbus, OH
Pinned Repositories
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
EDSR-PyTorch-Horovod
PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)
gpt-neox-fork
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Leetcode-notes
A repo to include notes I wrote about Leetcode problems
Megatron-DeepSpeed-fork
Ongoing research training transformer language models at scale, including: BERT & GPT-2
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
Quandary-Public
fork of CSE6341 Project Quandary-Public
R0n12's Repositories
R0n12/EDSR-PyTorch-Horovod
PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)
R0n12/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
R0n12/gpt-neox-fork
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
R0n12/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
R0n12/Leetcode-notes
A repo to include notes I wrote about Leetcode problems
R0n12/Megatron-DeepSpeed-fork
Ongoing research training transformer language models at scale, including: BERT & GPT-2
R0n12/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
R0n12/pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
R0n12/Quandary-Public
fork of CSE6341 Project Quandary-Public
R0n12/TTSR
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
R0n12/rocm-from-source
Building rocm from source