/GDTrainingFramework

General distributed training framework based on Pytorch

Primary LanguagePython

General Distributed Training Framework

This is a general distributed deeplearning training framework based on Pytorch 1.0. You only need to add your model and loss function for a specific task.

Highlights

  • Distributed & Single GPU Flexible selection between distributed training with multi gpus and a single gpu.
  • Flexible Visulization Implementation You can implement any visulization function in "your_model.py" for network comprehensive analysis.
  • Suport Various Optimizers and Learning-rate Policy Provide all the optimizers and learning-rate schedulers in pytorch.
  • Mixed Precision Training Support mixed precision training with NVIDIA apex lib.
  • Sync BN Support Sync BN when training with multi gps.
  • Break-point Restoration Support continue to train from a break-point.