The key takeaways from this section include:
- Validation and test sets are used when iteratively building deep neural networks
- Like traditional machine learning models, we need to watch out for the bias variance trade-off when building deep learning models
- Examples of alternatives for gradient descent are: RMSprop, Adam, Gradient Descent with Momentum, etc.
- Hyperparameter tuning is of crucial importance when working with deep learning models, as setting the parameters right can lead to great improvements in model performance
- Several regularization techniques can help us limit overfitting: L1 Regularization, L2 Regularization, Dropout Regularization, etc.
- Training of deep neural networks can be sped up by using normalized inputs
- Normalized inputs can also help mitigate a common issue of vanishing or exploding gradients