Implemented Dropout, Batch Normalization and etc.
- Add visualization tests
- Cli to experiment to hyperparameters
- Gradient Checking
- Fix Dropout (Make it into a layer)
- Normalize Weights every iteration [Experimental]
- Early stop
- Sigmoid Layer
- Softmax Layer