This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Last update: Jul 14, 2022

Related tags

Deep Learning L2ight

Overview

L2ight

By Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen and David Z. Pan.

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Introduction

L2ight is a closed-loop ONN on-chip learning framework to enable scalable ONN mapping and efficient in-situ learning. L2ight adopts a three-stage learning flow that first calibrates the complicated photonic circuit states under challenging physical constraints, then performs photonic core mapping via combined analytical solving and zeroth-order optimization. A subspace learning procedure with multi-level sparsity is integrated into L2ight to enable in-situ gradient evaluation and fast adaptation, unleashing the power of optics for real on-chip intelligence. L2ight outperforms prior ONN training protocols with 3-order-of-magnitude higher scalability and over 30X better efficiency, when benchmarked on various models and learning tasks. This synergistic framework is the first scalable on-chip learning solution that pushes this emerging field from intractable to scalable and further to efficient for next-generation self-learnable photonic neural chips.

Dependencies

Python >= 3.6
pyutils >= 0.0.1. See pyutils for installation.
pytorch-onn >= 0.0.1. See pytorch-onn for installation.
Python libraries listed in requirements.txt
NVIDIA GPUs and CUDA >= 10.2

Structures

core/
- models/
  - layers/
    - custom_conv2d and custom_linear layers
    - utils.py: sampler and profiler
  - sparse_bp_*.py: model definition
  - sparse_bp_base.py: base model definition; identity calibration and mapping codes.
- optimizer/: mixedtrain and flops optimizers
- builder.py: build training utilities
script/: contains experiment scripts
train_pretrain.py, train_map.py, train_learn.py, train_zo_learn.py: training logic
compare_gradient.py: compare approximated gradients with true gradients for ablation

Usage

Pretrain model.
> python3 train_pretrain.py config/cifar10/vgg8/pretrain.yml
Identity calibration and parallel mapping. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/pm/pm.yml and run
> python3 train_map.py CONFIG --checkpoint.restore_checkpoint=path/to/your/pretrained/checkpoint
Subspace learning with multi-level sampling. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/ds/learn.yml and run
> python3 train_learn.py CONFIG --checkpoint.restore_chekcpoint=path/to/your/mapped/checkpoint --checkpoint.resume=1
All scripts for experiments are in ./script. For example, to run subspace learning with feedback sampling, column sampling, and data sampling, you can write proper task setting in SCRIPT=script/vgg8/train_ds_script.py and run
> python3 SCRIPT
Comparison experiments with RAD [ICLR 2021] and SWAT-U [NeurIPS 2020]. Run with the SCRIPT=script/vgg8/train_rad_script.py and script/vgg8/train_swat_script.py,
> python3 SCRIPT
Comparison with FLOPS [DAC 2020] and MixedTrn [AAAI 2021]. Run with the METHOD=mixedtrain or flops,
> python3 train_zo_learn.py config/mnist/cnn3/METHOD/learn.yml

Citing L2ight

@inproceedings{gu2021L2ight,
  title={L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization},
  author={Jiaqi Gu and Hanqing Zhu and Chenghao Feng and Zixuan Jiang and Ray T. Chen and David Z. Pan},
  journal={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Related tags

Overview

L2ight

Introduction

Dependencies

Structures

Usage

Citing L2ight

Owner

Jiaqi Gu

METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)

The pytorch implementation of the paper "text-guided neural image inpainting" at MM'2020

The code for our paper submitted to RAL/IROS 2022: OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition.

An open-source online reverse dictionary.

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Hydra Lightning Template for Structured Configs

Implementation of Nyström Self-attention, from the paper Nyströmformer

The Most Efficient Temporal Difference Learning Framework for 2048

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Annealed Flow Transport Monte Carlo

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN)

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

Easy to use Python camera interface for NVIDIA Jetson

A library of extension and helper modules for Python's data analysis and machine learning libraries.

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

[ICCV21] Official implementation of the "Social NCE: Contrastive Learning of Socially-aware Motion Representations" in PyTorch.