Fast Scattering Transform with CuPy/PyTorch

Last update: Dec 07, 2022

Overview

Announcement

11/18

This package is no longer supported. We have now released kymatio: http://www.kymat.io/ , https://github.com/kymatio/kymatio which includes 1D-2D-3D fast, optimized, differentiable Scattering Transform and subsumes all the behavior of pyscatwave. Among other things you can now more easily use differentiable 2d scattering and use the CPU if desired. kymatio will be well supported with a substantially larger development team than pyscatwave.

07/18

We just released a differentiable 2D Scattering example in the master. It is not memory efficient yet, neither fast.

PyScatWave

CuPy/PyTorch Scattering implementation

A scattering network is a Convolutional Network with filters predefined to be wavelets that are not learned and it can be used in vision task such as classification of images. The scattering transform can drastically reduce the spatial resolution of the input (e.g. 224x224->14x14) with demonstrably neglible loss in dicriminative power.

The software uses PyTorch + NumPy FFT on CPU, and PyTorch + CuPy + CuFFT on GPU.

Previous (lua-based) versions of the code can be found at https://github.com/edouardoyallon/scatwave

If using this code for your research please cite our paper:

E. Oyallon, E. Belilovsky, S. Zagoruyko Scaling the Scattering Transform: Deep Hybrid Networks

You can find experiments from the paper in the following repository: https://github.com/edouardoyallon/scalingscattering/

We used PyTorch for running experiments in https://arxiv.org/abs/1703.08961, but it is possible to use scattering with other frameworks (e.g. Chainer, Theano or Tensorflow) if one copies Scattering outputs to CPU (or run on CPU and convert to numpy.ndarray via .numpy()).

Benchmarks

We do some simple timings and comparisons to the previous (multi-core CPU) implementation of scattering (ScatnetLight). We benchmark the software using a 1080 GPU. Below we show input sizes (WxHx3xBatchSize) and speed:

32 × 32 × 3 × 128 (J=2)- 0.03s (speed of 8x vs ScatNetLight)

256 × 256 × 3 × 128 (J=2) - 0.71 s (speed up of 225x vs ScatNetLight)

Installation

The software was tested on Linux with anaconda Python 2.7 and various GPUs, including Titan X, 1080s, 980s, K20s, and Titan X Pascal.

The first step is to install pytorch following instructions from http://pytorch.org, then you can run pip:

pip install -r requirements.txt
python setup.py install

Usage

Example:

import torch
from scatwave.scattering import Scattering

scat = Scattering(M=32, N=32, J=2).cuda()
x = torch.randn(1, 3, 32, 32).cuda()

print scat(x).size()

Contribution

All contributions are welcome.

Authors

Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko

Fast Scattering Transform with CuPy/PyTorch

Related tags

Overview

Announcement

PyScatWave

Benchmarks

Installation

Usage

Contribution

Authors

Owner

Edouard Oyallon

Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

This reporistory contains the test-dev data of the paper "xGQA: Cross-lingual Visual Question Answering".

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

TDmatch is a Python library developed to perform matching tasks in three categories:

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR

Clean Machine Learning, a Coding Kata

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Stock-history-display - something like a easy yearly review for your stock performance

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)

Junction Tree Variational Autoencoder for Molecular Graph Generation (ICML 2018)

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Neural models of common sense. 🤖

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Details about the wide minima density hypothesis and metrics to compute width of a minima

Waymo motion prediction challenge 2021: 3rd place solution

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "