horovod
There are 52 repositories under horovod topic.
tony-framework/TonY
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
kubeflow/mpi-operator
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
tensorlayer/awesome-tensorlayer
A curated list of dedicated resources and applications
jzlianglu/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
Photon-AI-Research/NeuralSolvers
Neural network based solvers for partial differential equations and inverse problems :milky_way:. Implementation of physics-informed neural networks in pytorch.
open-ce/open-ce
This repository provides the Open-CE environment files and version definitions for each Open-CE release.
richardkxu/distributed-pytorch
Distributed, mixed-precision training with PyTorch
polyaxon/polyaxon-examples
Code for tutorials and examples
saforem2/l2hmc-qcd
Application of the L2HMC algorithm to simulations in lattice QCD.
ShomyLiu/torch-ddp-examples
A text classification example using ddp horovod and accelerate
heyfey/vodascheduler
GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)
Qznan/QizNLP
Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)
ankurhanda/tf-unet
tensorflow version of unet
graykode/horovod-ansible
Create Horovod cluster easily using Ansible
eth-cscs/tensorflow-training
Multi-GPU training with TensorFlow on Piz Daint
Himscipy/bnn_hvd
Distributed Training of Bayesian Neural Networks at Scale
aws-samples/sagemaker-distributed-training-digital-pathology-images
Distributed training of digital pathology tissue slide images using SageMaker and Horovod.
Shenggan/DeepCell-Keras
Reimplement Deep Cell with Keras and Horovod.
nemoramo/acoustic_model
This is a sub-repository in building to create acoustic model in Mandarin speech recognition.
asprenger/distributed-training-patterns
Experiments with low level communication patterns that are useful for distributed training.
GiancarloPaoletti/PBS_qsub_gridsearch
Simple bash script to launch gridsearch qsub jobs on PBS
manoharpalanisamy/Distributed-Deep-Learning-With-Horovod-MPI
Distributed training framework for TensorFlow, Keras
oekosheri/pytorch_unet_scaling
Scaling Unet in Pytorch
Smarker/batchai-benchmark
Distributed training with Batch AI
afogarty85/petastorm
deep learning at scale
chrisabbott/distributed-unet
Segmenting EM-shower particles and track particles using Unet and Horovod
dellemc-hpc-ai/ai-radiologist-GPU
GPU Optimized version of AI Radiologist
explcre/SHUKUN-Technology-AlgorithmIntern-MultiNodeTraining-for-DLmodels-Horovod-ConfigurationTutorial-Perf
SHUKUN Technology Co.,Ltd Algorithm intern (2020/12-2021/5). Multi-GPU, Multi-node training for deep learning models. Horovod, NVIDIA clara train sdk, configuration tutorial,performance testing.
MuhammadShifa/Anamoly-Detection-Model-ADM-in-Health-Care-Insurance-
Horovod is a distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. The goal of Horovod is to make distributed deep learning fast and easy to use.
Nishant-codex/rnn_flip_flops
This repository contains the code for RNNs which are trained for 3 bits Flip-flop task
oekosheri/tensorflow_unet_scaling
Scaling Unet in Tensorflow
pnnl/ProxyTSPRD
Proxy application for analyzing dynamical systems.
veritas9872/Horovod-Pytorch-Tutorial
Horovod Tutorial for Pytorch using NVIDIA-Docker.
WOGRA-AG/docker-ludwig-ray-gpu-jupyter
Making the official ludwigai/ludwig-ray-gpu image available for jupyterhub.
Yujaeseo/NGCF_Pytorch
NGCF(Neural Graph Collaborative Filtering) Pytorch & Horovod implementation
bsraya/schedulearn
Training Deep Learning models made easy