/MicrogradPlus

MicrogradPlus is an educational project aiming to provide a simple, yet extensible, NumPy-based automatic differentiation library.

Primary LanguagePythonMIT LicenseMIT

MicrogradPlus

Introduction

MicrogradPlus is an educational project aiming to provide a simple, yet extensible, NumPy-based automatic differentiation library. It is a modified version of the original Micrograd engine, with added functionalities for exploring the basics of deep learning frameworks. The goal of Micrograd_Plus is to extend Andrej Karpathy's micrograd implementation to provide a basis for vectorized auto gradients. The original Micrograd was built as a scalar-value gradient tool-kit and educational framework. This project aims to extend this functionality to Matricies and Vectors with the intention of introducing the Operations required to do so for calculating gradients in an auto-diff library.

This project was inspired by Micrograd and Tinygrad in hopes of expanding the gradient toolkit in an intermediate fashion. Tinygrad was inspired as a succesor to Micrograd, however does not provide a simplistic oversight in terms of demonstrating how the transition between scalar and vectorized auto gradients should be handled. This project aims to fill in the educational gaps between the two implementations. The idea of Tinygrad is to use ShapeTracker, where the shapes determine the gradients of the operands. In this implementation, it is clear where the shape tracker would be useful and how this can play into providing an optimized framework.

If you liked this project, feel free to message me about it on Twitter or check out my website for other avenues of connection.

Feel free to add any additional operations which I may have missed in this project, a pull request is all that is required!

Features

  • Scalar and matrix operations
  • Backpropagation for automatic gradient calculation
  • Functions for basic operations like addition, multiplication, division, and matrix multiplication
  • Support for more advanced operations like sigmoid, logarithms, and power
  • Element-wise operations and broadcasting

Usage

Here is a simple example demonstrating the use of the library:

from micrograd_plus import NP_Value

# Create two scalar values
a = NP_Value(2.0)
b = NP_Value(3.0)

# Perform operations
c = a + b
d = a * c

# Compute gradients
d.backward()

print(a.grad)  # Output should be the derivative of d with respect to a

Core Components

NP_Value

The NP_Value class is the heart of the library. Each NP_Value instance represents a differentiable scalar or tensor value.

Operations Supported:

  • __add__, __sub__, __mul__, __div__, __matmul__: Basic arithmetic operations
  • sigmoid: Sigmoid activation function
  • log: Logarithm
  • __pow__: Power
  • sum: Summation over a specific axis
  • unsqueeze: Adds an extra dimension to the data
  • squeeze: Removes dimensions of size 1 from the data

Properties:

  • data: The actual data (NumPy array)
  • grad: The gradient, initially set to zero

Methods:

  • backward(): Computes the gradient with respect to all its ancestors in the computation graph

Example:

a = NP_Value([1.0, 2.0])
b = NP_Value([3.0, 4.0])
c = a + b

c.backward()

After the backward pass, gradients will be populated in a.grad and b.grad.

Caveats and Limitations

  • The current implementation does not handle all corner cases (e.g., higher-order gradients).
  • Broadcasting is supported only in limited cases.

Contribution

Feel free to contribute to the library by opening issues or pull requests.

License

MIT License. See the LICENSE file for more details.