Pure python implementation reverse-mode automatic differentiation

Last update: Sep 12, 2022

Related tags

Overview

MiniGrad

A minimal implementation of reverse-mode automatic differentiation (a.k.a. autograd / backpropagation) in pure Python.

Inspired by Andrej Karpathy's micrograd, but with more comments and less cleverness. Thanks for the wonderful reference implementation and tests!

Overview

Create a Scalar.

a = Scalar(1.5)

Do some calculations.

b = Scalar(-4.0)
c = a**3 / 5
d = c + (b**2).relu()

Compute the gradients.

d.backward()

Plot the computational graph.

draw_graph(d)

Repo Structure

demo.ipynb: Demo notebook of MiniGrad's functionality.
tests.ipynb: Test notebook to verify gradients against PyTorch and JAX. Install both to run tests.
minigrad/minigrad.py: The entire autograd logic in one (~100 loc) numeric class. See section below for details.
minigrad/visualize.py: This just draws nice-looking computational graphs. Install Graphviz to run it.
requirements.txt: MiniGrad requires no external modules to run. This file just sets up my dev environment.

Implementation

MiniGrad is implemented in one small (~100 loc) Python class, using no external modules.

The entirety of the auto-differentiation logic lives in the Scalar class in minigrad.py.

A Scalar wraps a float/int and overrides its arithmetic magic methods in order to:

Stitch together a define-by-run computational graph when doing arithmetic operations on a Scalar
Hard code the derivative functions of arithmetic operations
Keep track of ∂self/∂parent between adjacent nodes
Compute ∂output/∂self with the chain rule on demand (when .backward() is called)

This is called reverse-mode automatic differentiation. It's great when you have few outputs and many inputs, since it computes all derivatives of one output in one pass. This is also how TensorFlow and PyTorch normally compute gradients.

(Forward-mode automatic differentiation also exists, and has the opposite advantage.)

Not in Scope

This project is just for fun, so the following are not planned:

Vectorization
Higher order derivatives (i.e. Scalar.grad is a Scalar itself)
Forward-mode automatic differentiation
Neural network library on top of MiniGrad

Pure python implementation reverse-mode automatic differentiation

Related tags

Overview

MiniGrad

Overview

Repo Structure

Implementation

Not in Scope

Owner

Kenny Song

Code for CVPR2021 "Visualizing Adapted Knowledge in Domain Transfer". Visualization for domain adaptation. #explainable-ai

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Meta Language-Specific Layers in Multilingual Language Models

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Instant Real-Time Example-Based Style Transfer to Facial Videos

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')

toroidal - a lightweight transformer library for PyTorch

Face recognition. Redefined.

Official PyTorch implementation of "Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient".

Self-driving car env with PPO algorithm from stable baseline3

This is a work in progress reimplementation of Instant Neural Graphics Primitives

Evolution Strategies in PyTorch

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Revisting Open World Object Detection

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Credit fraud detection in Python using a Jupyter Notebook

Solutions and questions for AoC2021. Merry christmas!

Neural Ensemble Search for Performant and Calibrated Predictions

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning