The generic framework for stochastic gradient (SG) based algorithms.
Momentum (EMA of gradient) mechanism + SPGD = SPGD with momentum.
Adaptive learning rate method + SPGD = SPGD with adaptive learning rate.
The convergence speed of SPGD with momentum (and SPGD with adaptive learning rate) is faster than the original one.