Learning to See by Looking at Noise

Last update: Dec 24, 2022

Related tags

Overview

Learning to See by Looking at Noise

This is the official implementation of Learning to See by Looking at Noise.

In this work, we investigate a suite of image generation models that produce images from simple random processes. These are then used as training data for a visual representation learner with a contrastive loss. We study two types of noise processes, statistical image models and deep generative models under different random initializations.

[Project page] [Paper] [arXiv]

Requirements

This version of code has been tested with Python 3.7.7 and pytorch 1.6.0. Other versions of pytorch are likely to work out of the box. The contrastive training requires two GPU's with at least 12GB of memory for the small scale experiments, while the large scale experiments require the same computation resources as the facebookresearch implementation of MoCo.

To use this repo, first clone it and correct the permissions for the scripts with:

git clone https://github.com/mbaradad/learning_with_noise
cd learning_with_noise
chmod 755 -R scripts

To install all the requirements, simply do:

pip intall -r requirements.txt

Small scale experiments

To download all the datasets, first run:

./scripts/download_datasets/download_small_scale_datasets.sh

Then you can launch the contrastive training for all the small scale experiments with:

./scripts/train_align_uniform/main.sh <GPU_ID_0> <GPU_ID_1>

If you just want to test the linear evaluation of the models (or do something else with them), you can directly download our pretrained encoders with:

./scripts/download_pretrained_models/download_all_alexnet_encoders.sh

Finally, you can evaluate the linear performance with imagenet100 as:

./scripts/train_align_uniform/linear_eval.sh <path-to-imagenet100> <GPU_ID>

Where is the path to the imagenet100 dataset dir, which should contain two dirs (train and val) each with the train and val samples respectively for the 100 imagenet100 classes. If you have imagenet1k, you can generate imagenet100 using the following command (which will create simlyncs to your imagenet1k dir):

./scripts/datasets/generate_imagenet100.sh <path-to-imagenet1k> <path-to-imagenet100>

Large scale experiments

Datasets and encoders will be be released soon!

Data generation

Scripts to generate the datasets will be released soon!

Learning to See by Looking at Noise

Related tags

Overview

Learning to See by Looking at Noise

Requirements

Small scale experiments

Large scale experiments

Data generation

Owner

Manel Baradad Jurjo

A unified 3D Transformer Pipeline for visual synthesis

This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider

Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning

Repo for parser tensorflow(.pb) and tflite(.tflite)

A Python package to create, run, and post-process MODFLOW-based models.

Automate issue discovery for your projects against Lightning nightly and releases.

Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

PyTorch Implementation of the paper Learning to Reweight Examples for Robust Deep Learning

Cl datasets - PyTorch image dataloaders and utility functions to load datasets for supervised continual learning

Python implementation of NARS (Non-Axiomatic-Reasoning-System)

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

DNA sequence classification by Deep Neural Network

R3Det based on mmdet 2.19.0

PyTorch version repo for CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"