DeepOrigins

Building the foundations of deep learning from matrix multiplication and backpropagation to ResNets and beyond

Notebooks:

01_Matrix-Multiplication.ipynb:

Optimizing matrix multiplication from scratch:

Nested loops in standard Python
Array Slicing
Array Broadcasting
Einstein Summation in PyTorch
Standard PyTorch

Matrix multiplication in standard PyTorch is about 44,000 times faster than using standard python

02_Neural-Network-Forward-Pass.ipynb:

Demonstrating the difficulty in training neural networks:

Exploding Activations with added depth [Solution: Xavier Initialization]
Vanishing Activations when using ReLU [Solution: Kaiming Initialization]
Improvements with Parametric/Leaky/Shifted ReLU

After developing an appreciation of challenges in training neural networks, we build a Feed Forward Neural Network that mimics PyTorch's modular design.

03_Neural-Network-Backpropagation.ipynb:

Implementing Autograd (Automatic Differentiation) functionality for:

Linear layer: Affine function
Activation layer: ReLU
Loss layer: Mean Squared Error

We design a layer abstraction class to build a Fully Connected Neural Network capable of backpropagating errors using automatic differentiation of its computation graph. PyTorch's design choices like nn.Module starts to make perfect sense.

04_Rebuilding-PyTorch-Internals.ipynb:

We explore the internal abstractions and architecture of PyTorch in depth and rebuild it from scratch:

PyTorch Data Abstractions
1. Dataset
2. DataLoader
3. DataSampler
PyTorch Training Abstractions
1. nn.Parameters
2. nn.Sequential
3. Optimizer

After having dived deep into the inner workings of PyTorch, we gain a deeper understanding of deep learning concepts, the problems and the existing solutions. We get insight into the software architecture design and development process of a popular deep learning framework.