Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Last update: Nov 26, 2021

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

This library contains JAX and Pytorch implementations of neural ODEs and Bayesian layers for stochastic variational inference. A rudimentary JAX implementation of differentiable SDE solvers is also provided, refer to torchsde [2] for a full set of differentiable SDE solvers in Pytorch and similarly to torchdiffeq [3] for differentiable ODE solvers.

Continuous-depth hidden unit trajectories in Neural ODE vs uncertain posterior dynamics SDE-BNN.

Installation

This library runs on jax==0.1.77 and torch==1.6.0. To install all other requirements:

pip install -r requirements.txt

Note: Package versions may change, refer to official JAX installation instructions here.

JaxSDE: Differentiable SDE Solvers in JAX

The jaxsde library contains SDE solvers in the Ito and Stratonovich form. Solvers of different orders can be specified with the following method={euler_maruyama|milstein|euler_heun} (strong orders 0.5|1|0.5 and orders 1|1|1 in the case of an additive noise SDE). Stochastic adjoint (sdeint_ito) training mode does not work efficiently yet, use sdeint_ito_fixed_grid for now. Tradeoff solver speed for precision during training or inference by adjusting --nsteps <# steps>.

Usage

Default solver: Backpropagation through the solver.

from jaxsde.jaxsde.sdeint import sdeint_ito_fixed_grid

y1 = sdeint_ito_fixed_grid(f, g, y0, ts, rng, fw_params, method="euler_maruyama")

Stochastic adjoint: Using O(1) memory instead of solving an adjoint SDE in the backward pass.

from jaxsde.jaxsde.sdeint import sdeint_ito

y1 = sdeint_ito(f, g, y0, ts, rng, fw_params, method="milstein")

Brax: Bayesian SDE Framework in JAX

Implementation of composable Bayesian layers in the stax API. Our SDE Bayesian layers can be used with the SDEBNN block composed with multiple parameterizations of time-dependent layers in diffeq_layers. Sticking-the-landing (STL) trick can be enabled during training with --stl for improving convergence rate. Augment the inputs by a custom amount --aug <integer>, set the number of samples averaged over with --nsamples <integer>. If memory constraints pose a problem, train in gradient accumulation mode: --acc_grad and gradient checkpointing: --remat.

Samples from SDEBNN-learned predictive prior and posterior density distributions.

Usage

All examples can be swapped in with different vision datasets. For better readability, tensorboard logging has been excluded (see torchbnn instead).

Toy 1D regression to learn complex posteriors:

python examples/jax/sdebnn_toy1d.py --ds cos --activn swish --loss laplace --kl_scale 1. --diff_const 0.2 --driftw_scale 0.1 --aug_dim 2 --stl --prior_dw ou

Image Classification:

To train an SDEBNN model:

python examples/jax/sdebnn_classification.py --output <output directory> --model sdenet --aug 2 --nblocks 2-2-2 --diff_coef 0.2 --fx_dim 64 --fw_dims 2-64-2 --nsteps 20 --nsamples 1

To train a ResNet baseline, specify --model resnet and for a Bayesian ResNet baseline, specify --meanfield_sdebnn.

TorchBNN: SDE-BNN in Pytorch

A PyTorch implementation of the Brax framework powered by the torchsde backend.

Usage

All examples can be swapped in with different vision datasets and includes tensorboard logging for critical metrics.

Toy 1D regression to learn multi-modal posterior:

python examples/torch/sdebnn_toy1d.py --output_dir <dst_path>

Arbitrarily expression approximate posteriors from learning non-Gaussian marginals.

Image Classification:

All hyperparameters can be found in the training script. Train with adjoint for memory efficient backpropagation and adaptive mode for adaptive computation (and ensure --adjoint_adaptive True if training with adjoint and adaptive modes).

python examples/torch/sdebnn_classification.py --train-dir <output directory> --data cifar10 --dt 0.05 --method midpoint --adjoint True --adaptive True --adjoint_adaptive True --inhomogeneous True

References

[1] Winnie Xu, Ricky T. Q. Chen, Xuechen Li, David Duvenaud. "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations." Preprint 2021. [arxiv]

[2] Xuechen Li, Ting-Kam Leonard Wong, Ricky T. Q. Chen, David Duvenaud. "Scalable Gradients for Stochastic Differential Equations." AISTATS 2020. [arxiv]

[3] Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, David Duvenaud. "Neural Ordinary Differential Equations." NeurIPS. 2018. [arxiv]

If you found this library useful in your research, please consider citing

@article{xu2021sdebnn,
  title={Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations},
  author={Xu, Winnie and Chen, Ricky T. Q. and Li, Xuechen and Duvenaud, David},
  archivePrefix = {arXiv},
  year={2021}
}

Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Related tags

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

Installation

JaxSDE: Differentiable SDE Solvers in JAX

Usage

Brax: Bayesian SDE Framework in JAX

Usage

Toy 1D regression to learn complex posteriors:

Image Classification:

TorchBNN: SDE-BNN in Pytorch

Usage

Toy 1D regression to learn multi-modal posterior:

Image Classification:

References

Owner

Winnie Xu

A Strong Baseline for Image Semantic Segmentation

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

This is the repository for The Machine Learning Workshops, published by AI DOJO

Official Implementation for the "An Empirical Investigation of 3D Anomaly Detection and Segmentation" paper.

A python library for implementing a recommender system

A fast MoE impl for PyTorch

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

TensorFlow implementation of "Attention is all you need (Transformer)"

TrackFormer: Multi-Object Tracking with Transformers

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.

Udacity Suse Cloud Native Foundations Scholarship Course Walkthrough

This repository contains the reference implementation for our proposed Convolutional CRFs.

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A tool for making map images from OpenTTD save games

Pathdreamer: A World Model for Indoor Navigation

Machine learning algorithms for many-body quantum systems

Blind Image Super-resolution with Elaborate Degradation Modeling on Noise and Kernel

Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Related tags

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

Installation

JaxSDE: Differentiable SDE Solvers in JAX

Usage

Brax: Bayesian SDE Framework in JAX

Usage

Toy 1D regression to learn complex posteriors:

Image Classification:

TorchBNN: SDE-BNN in Pytorch

Usage

Toy 1D regression to learn multi-modal posterior:

Image Classification:

References

Owner

Winnie Xu

A Strong Baseline for Image Semantic Segmentation

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

This is the repository for The Machine Learning Workshops, published by AI DOJO

Official Implementation for the "An Empirical Investigation of 3D Anomaly Detection and Segmentation" paper.

A python library for implementing a recommender system

A fast MoE impl for PyTorch

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

TensorFlow implementation of "Attention is all you need (Transformer)"

TrackFormer: Multi-Object Tracking with Transformers

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.

Udacity Suse Cloud Native Foundations Scholarship Course Walkthrough

This repository contains the reference implementation for our proposed Convolutional CRFs.

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A tool for making map images from OpenTTD save games

Pathdreamer: A World Model for Indoor Navigation

Machine learning algorithms for many-body quantum systems

Blind Image Super-resolution with Elaborate Degradation Modeling on Noise and Kernel

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥