An MLP class implementation from scratch whc=ich is very closed to what they used in tensorflow and pytorch. with ReLU, Sigmoid. Softmax, Linear activation functions. and CrossEntropy, BinaryCrossEntropy Loss
It resulted in 97.3% accuracy on Mnist dataset