Anderson Accelerated Deep Learning (AADL)

AADL is a Python package that implements the Anderson acceleration to speed-up the training of deep learning (DL) models using the PyTorch library.
AA is an extrapolation technique that can accelerate fixed-point iterations such those arising from the iterative training of DL models. However, large volume of data are typically processed in sequential random batches which introduces stochastic oscillations in the fixed-point iteration that hinders AA acceleration. AADL implements a moving average that reduces the oscillations and results in a smoother sequence of gradient descent updates which enables the use of AA. AADL uses a criterion to automatically decide if the moving average is needed by monitoring if the relative standard deviation between consecutive stochastic gradient updates exceeds a tolerance defined by the user.

Requirements

Python 3.5 or greater
PyTorch (any version works)

Installation

AADL comes with a setuptools install script:

python3 setup.py install

Usage

import torch
import torch.nn
import torch.optim
import AADL

# Creation of the DL model (neural network)
class model(torch.nn.Module):
	...

# Definition of the stochastic optimizer used to train the model
optimizer = torch.optim.SGD(model.parameters(), lr=1e-3, momentum=0.9, nesterov = True)

# Parameters for Anderson acceleration
relaxation = 0.5
wait_iterations = 0
history_depth = 10
store_each_nth = 10
frequency = store_each_nth
reg_acc = 0.0
safeguard = True
average = True

# Over-writing of the torch.optim.step() method 
AADL.accelerate(optimizer_anderson, "anderson", relaxation, wait_iterations, history_depth, store_each_nth, frequency, reg_acc, average)

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

BSD-3-Clause

Citations

"AADL: Anderson Accelerated Deep Learning", Copyright ID#: 81927550 https://doi.org/10.11578/dc.20210723.1

Anderson Acceleration for Deep Learning

Related tags

Overview

Anderson Accelerated Deep Learning (AADL)

Requirements

Installation

Usage

Contributing

License

Citations

Owner

Oak Ridge National Laboratory

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

DGL-TreeSearch and the Gurobi-MWIS interface

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

Official PyTorch Implementation of "Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs". NeurIPS 2020.

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

Interactive web apps created using geemap and streamlit

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python implementation of Bayesian optimization over permutation spaces.

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

To prepare an image processing model to classify the type of disaster based on the image dataset

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

MT3: Multi-Task Multitrack Music Transcription

Complete the code of prefix-tuning in low data setting

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

This is the repository for The Machine Learning Workshops, published by AI DOJO

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Source code for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Implementation of Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

Dashboard for the COVID19 spread