/Neural_Network_From_Scratch

This is the code of a Neural Network built from scratch without any libraries other than NumPy to deal with matrices and to handle the Linear Algebra part.

Primary LanguagePythonMIT LicenseMIT

Simple Neural Network From Scratch

In this project, I break down the basic idea of Neural Networks without any libraries -other than NumPy- just simple to understand how the NN works in the background. I used NumPy to handle the Linear Algebra part, dealing with matrices, multiplying them, and so on.


Logo

Tech Stack

  • Python: Version 3.10

  • NumPy: Version 1.23.0

  • Spyder IDE: Version 5.3.2

Details

  • After I watched a Youtube course about Neural Networks and when I came to the part to apply what I have learned, I got stuck with that! I was not able to code a simple NN by myself, I was just copying the code with no understanding of how it works. I searched here and there until I got the point. The idea is not difficult so I recommend you to see the References below for a better understanding.

  • You can Imagine this problem as a Student Affairs Specialist, we have 6 students with their grades in 3 subjects (e.g. Math, Physics, Music, Arabic Langauge... Chiemsrty). if the student passed specific subjects, will he pass to next year or he will get failed and repeat the year? We assume that 1 = passed the subjects and 0 = Did not pass the subject. Also, in the results, 1 = passed the year successfully, and 0 = failed the year (Did not pass it). So, our goal is to determine if a student with X grades in Y subjects will pass the year. This is our problem.

  • The subjects are called Features which are the columns, and the rows which are how many students we have it is 8 in this example, and of course, you can add or delete some data. The backpropagation equation is written in the video in the References check it.

  • My Neural Network is supposed to solve a Binary Classification Problem, I created the dataset as an example to focus more on the core idea. However, you can add your dataset and it should work fine! I made the NN with 3 Layers(Not including the Input Layer), the first layer has 9 Neurons and the second layer has 5 Neurons and The last Layer(Output Layer) has 1 Neuron. In addition, I used the Sigmoid function as an Activation Function. (Please See figure 1 below to understand the Diagram of the NN).

  • Why did I choose these specific numbers of layers and neurons? I do not know! Till now, I still do not know how to set these Parameters in a reasonable way not just choosing them without any sense! I keep searching about that and once I understand itو I shall update this file with the clarification.

Update Version V2.0

  • Well... I have found many bugs and incorrect concepts I have applied here. I will list them here so you can avoid them.


  • The first mistake, I did not know that for a Binary Classification Problem it's recommended to use a Leaky Relu (Or Relu) as an Activation Function in the hidden layers while in the Output layer I have to use the Sigmoid Function. So, simply I just applied these concepts but doing this Only did not solve the problem.

  • Secondly, I applied the derivatives of the Activation Function improperly :( Thus, the expected outputs did not make any sense. When I give it a look again, solved it by hand. So, it became more reasonable.

  • Thirdly, I did not multiply the weights and biases by the Factor alpha α or eta η (Learning rate) So it leads to NaN Values or Constant values(i.e. The same output for all instances)

  • Fourth, I used biases for every neuron for every sample- which is not correct indeed. the right thing is to make a bias for every neuron. Not for every example per every neuron. I used np.sum() to sum the whole row per training examples so finally I will have a vector of biases n*1 where n is number is the Neurons number

  • Finally, I used a very simple dataset I just created as I have also changed the neurons number (but the figures still working tho) just to test the algorithm. However, when I tried Haberman's Survival Data Set it did not work. I do not know why, so, I think this might need some PreProcessing Data before we could start training. I do not know how to do that yet, but I will keep this updated if I managed to do it. You will find the dataset filtered and sorted randomly in the Reposoitry


Figures

Figure 1


Figure 2

Contributing

Contributions are what makes the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Do not forget to give the project a star! Thanks again!


License

Distributed under the MIT License. See LICENSE.txt for more information.

References

Contacts