Implementation of paper "Large Batch Training Does Not Need Warmup" paper