Multi-Glimpse Network With Python

Last update: May 10, 2022

Related tags

Deep Learning MGNet

Overview

Multi-Glimpse Network

Our code requires Python ≥ 3.8

Installation

For example, venv + pip:

$ python3 -m venv env
$ source env/bin/activate
(env) $ python3 -m pip install -r requirements.txt

Evaluation

Accuracy on clean images

Create ImageNet100 from ImageNet (using symbolic links).

$ python3 tools/create_imagenet100.py tools/imagenet100.txt \
    /path/to/ImageNet /path/to/ImageNet100

Download checkpoints from Google Drive.
Test accuracy.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Add the flag --flop_count to count the approximate FLOPs for the inference of an image. (using fvcore)

Accuracy on adversarial attacks (PGD)

Test adversarial accuracy.

# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Accuracy on common corruptions

Create ImageNet100-C from ImageNet-C (using symbolic links).

$ python3 tools/create_imagenet100c.py  \
    tools/imagenet100.txt  /path/to/ImageNet-C/ /path/to/ImageNet100-C/

Test for a single corruption.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100-C/pixelate/5 \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

A simple script to test all corruptions and collect results.

# Modify tools/eval_imagenet100c.py and run it to generate script
$ python3 tools/eval_imagenet100c.py /home2/ImageNet100-C/ > run.sh
# Evaluate
$ bash run.sh
# Collect results
$ python3 tools/collect_imagenet100c.py

Training

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --epochs 400 --n_iter 1 --scale 1.0 \
    --model resnet18 --gpu 0,1,2,3
# Ours
$ python3 main.py $dataset --epochs 400 --n_iter 4 --scale 2.33 \
    --model resnet18 --alpha 0.6 --s 0.02  --gpu 0,1,2,3

Check tensorboard for the logs. (When training with multiple gpus, the log value may be scaled by the number of gpus except for the validation accuracy)

tensorboard  --logdir=logs

Note that we left our exploration in the code for further study, e.g., self-supervised spatial guidance, dynamic gradient re-scaling operation.

Multi-Glimpse Network With Python

Related tags

Overview

Multi-Glimpse Network

Installation

Evaluation

Accuracy on clean images

Accuracy on adversarial attacks (PGD)

Accuracy on common corruptions

Training

Owner

DeOldify - A Deep Learning based project for colorizing and restoring old images (and video!)

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

Example of a Quantum LSTM

A collection of Google research projects related to Federated Learning and Federated Analytics.

Transfer Learning Remote Sensing

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

Athena is the only tool that you will ever need to optimize your portfolio.

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

HistoKT: Cross Knowledge Transfer in Computational Pathology

Projects of Andfun Yangon

Data augmentation for NLP, accepted at EMNLP 2021 Findings

[ICML 2020] DrRepair: Learning to Repair Programs from Error Messages

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

This project provides the proof of the uniqueness of the equilibrium and the global asymptotic stability.

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

Improving Object Detection by Label Assignment Distillation

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)