yashkant/padam-tensorflow
Reproducing the paper "PADAM: Closing The Generalization Gap of Adaptive Gradient Methods In Training Deep Neural Networks" for the ICLR 2019 Reproducibility Challenge
Python
Issues
- 0
high memory usage for Padam optimizer
#10 opened by leo-smi