(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

Last update: Dec 24, 2022

Overview

Relational Embedding for Few-Shot Classification (ICCV 2021)

Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho

[paper], [project hompage]

We propose to address the problem of few-shot classification by meta-learning “what to observe” and “where to attend” in a relational perspective. Our method leverages relational patterns within and between images via self-correlational representation (SCR) and cross-correlational attention (CCA). Within each image, the SCR module transforms a base feature map into a self-correlation tensor and learns to extract structural patterns from the tensor. Between the images, the CCA module computes cross-correlation between two image representations and learns to produce co-attention between them. (a), (b), and (c) visualize the activation maps of base features, self-correlational representation, and cross-correlational attention, respectively. Our Relational Embedding Network (RENet) combines the two relational modules to learn relational embedding in an end-to-end manner. In experimental evaluation, it achieves consistent improvements over state-of-the-art methods on four widely used few-shot classification benchmarks of miniImageNet, tieredImageNet, CUB-200-2011, and CIFAR-FS.

✔️ Requirements

Ubuntu 16.04
Python 3.7
CUDA 11.0
PyTorch 1.7.1

⚙️ Conda environmnet installation

conda env create --name renet_iccv21 --file environment.yml
conda activate renet_iccv21

📚 Datasets

cd datasets
bash download_miniimagenet.sh
bash download_cub.sh
bash download_cifar_fs.sh
bash download_tieredimagenet.sh

🌳 Authors' checkpoints

cd checkpoints
bash download_checkpoints_renet.sh

The file structure should be as follows:

renet/
├── datasets/
├── model/
├── scripts/
├── checkpoints/
│   ├── cifar_fs/
│   ├── cub/
│   ├── miniimagenet/
│   └── tieredimagenet/
train.py
test.py
README.md
environment.yml

📌 Quick start: testing scripts

To test in the 5-way K-shot setting:

bash scripts/test/{dataset_name}_5wKs.sh

For example, to test ReNet on the miniImagenet dataset in the 5-way 1-shot setting:

bash scripts/test/miniimagenet_5w1s.sh

🔥 Training scripts

To train in the 5-way K-shot setting:

bash scripts/train/{dataset_name}_5wKs.sh

For example, to train ReNet on the CUB dataset in the 5-way 1-shot setting:

bash scripts/train/cub_5w1s.sh

Training & testing a 5-way 1-shot model on the CUB dataset using a TitanRTX 3090 GPU takes 41m 30s.

🎨 Few-shot classification results

Experimental results on few-shot classification datasets with ResNet-12 backbone. We report average results with 2,000 randomly sampled episodes.

datasets	miniImageNet		tieredImageNet
setups	5-way 1-shot	5-way 5-shot	5-way 1-shot	5-way 5-shot
accuracy	67.60	82.58	71.61	85.28

datasets	CUB-200-2011		CIFAR-FS
setups	5-way 1-shot	5-way 5-shot	5-way 1-shot	5-way 5-shot
accuracy	79.49	91.11	74.51	86.60

🔍 Related repos

Our project references the codes in the following repos:

Zhang et al., DeepEMD.
Ye et al., FEAT
Wang et al., Non-local neural networks
Ramachandran et al., Stand-alone self-attention
Huang et al., DCCNet
Yang et al., VCN

💌 Acknowledgement

We adopted the main code bases from DeepEMD, and we really appreciate it 😃 . We also sincerely thank all the ICCV reviewers, especially R#2, for valuable suggestions.

📜 Citing RENet

If you find our code or paper useful to your research work, please consider citing our work using the following bibtex:

@inproceedings{kang2021renet,
    author   = {Kang, Dahyun and Kwon, Heeseung and Min, Juhong and Cho, Minsu},
    title    = {Relational Embedding for Few-Shot Classification},
    booktitle= {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    year     = {2021}
}

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

Related tags

Overview

Relational Embedding for Few-Shot Classification (ICCV 2021)

Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho

[paper], [project hompage]

✔️ Requirements

⚙️ Conda environmnet installation

📚 Datasets

🌳 Authors' checkpoints

📌 Quick start: testing scripts

🔥 Training scripts

🎨 Few-shot classification results

🔍 Related repos

💌 Acknowledgement

📜 Citing RENet

Owner

Dahyun Kang

Improving the robustness and performance of biomedical NLP models through adversarial training

Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

A TensorFlow implementation of Neural Program Synthesis from Diverse Demonstration Videos

Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training

A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

Predict multi paths to a moving person depending on his trajectory history.

Machine Learning with JAX Tutorials

Using pretrained language models for biomedical knowledge graph completion.

GAN-generated image detection based on CNNs

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

EsViT: Efficient self-supervised Vision Transformers

A little Python application to auto tag your photos with the power of machine learning.

Codecov coverage standard for Python

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

🏅 The Most Comprehensive List of Kaggle Solutions and Ideas 🏅

(NeurIPS 2020) Wasserstein Distances for Stereo Disparity Estimation

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021