Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Last update: Oct 18, 2021

Related tags

Overview

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

This repo contains official code for the NeurIPS 2021 paper Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations by Jiayao Zhang, Hua Wang, Weijie J. Su.

Discussions welcome, please submit via Discussions. You can also read the reviews on OpenReview.

@misc{zhang2021imitating,
      title={Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations}, 
      author={Jiayao Zhang and Hua Wang and Weijie J. Su},
      year={2021},
      eprint={2110.05960},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Reproducing Experiments

Dependencies

We use Python 3.8 and pytorch for training neural nets, please use pip install -r requirements.txt (potentially in a virtual environment) to install dependencies.

Datasets

We use a dataset of geometric shapes (GeoMNIST) we constructed as well as CIFAR-10. GeoMNIST is lightweighted and will be generated when simulation runs; CIFAR-10 will be downloaded from torchvision.

Code Structure

After instsalling the dependencies, one may navigate through the two Jupyter notebooks for running experiments and producing plots and figures. Below we outline the code structure.

.
├── LICENSE                         # code license
├── README.md                       # this file
├── LE-SDE Data Analysis.ipynb      # reproducing plots and figures
├── LE-SDE Experiments.ipynb        # reproducing experiments
└── src                         # source code
    ├── data_analyzer.py            # processing experiment data
    ├── datasets.py                 # generating and loading datasets
    ├── models.py                   # definition of neural net models
    ├── plotter.py                  # generating plots and figures
    └── utils.py                    # utilities, including training pipelines
└── exp_data                    # experiment data
    ├── *.csv                       # dataframes from neural net training
    └── *.npy                       # numpy.ndarray storing LE-ODE simulations

More info regarding npy files can be found in the numpy documentation.

Reproducing Figures

Experiment Data

Although all simulations can be run on your machine, it is quite time-consuming. Data from our experiments can be downloaded from the following anonymous Dropbox links:

lesde_exp_data.tar.gz (1.02GB): *.csv files for reproducing Figures 1-4.
lesde_sim_data.tar.gz (2.54GB): *.npy files for reproducing Figure 5.

After downloading those tarballs, extract them into ./exp_data (or change the EXP_DIR variable in the notebooks accordingly).

Plotter

Once experiment data are ready, simply follow LE-SDE Data Analysis.ipynb for reproducing all figures.

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Related tags

Overview

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Reproducing Experiments

Dependencies

Datasets

Code Structure

Reproducing Figures

Experiment Data

Plotter

Owner

Jiayao Zhang

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

A PyTorch implementation of a Factorization Machine module in cython.

Implementation of the pix2pix model on satellite images

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Fully Convolutional DenseNet (A.K.A 100 layer tiramisu) for semantic segmentation of images implemented in TensorFlow.

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

WarpRNNT loss ported in Numba CPU/CUDA for Pytorch

Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Libtorch yolov3 deepsort

Winners of DrivenData's Overhead Geopose Challenge

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Event-forecasting - Event Forecasting Algorithms With Python

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

Self-Supervised Contrastive Learning of Music Spectrograms

Learning Continuous Signed Distance Functions for Shape Representation

Generative Adversarial Networks(GANs)

Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences