Official Implementation of VAT

Last update: Dec 27, 2022

Overview

Semantic correspondence

Few-shot segmentation

Cost Aggregation Is All You Need for Few-Shot Segmentation

For more information, check out project [Project Page] and the paper on [arXiv].

Network

Our model VAT is illustrated below:

Environment Settings

git clone https://github.com/Seokju-Cho/Volumetric-Aggregation-Transformer.git

cd Volumetric-Aggregation-Transformer

conda env create -f environment.yaml

Preparing Few-Shot Segmentation Datasets

Download following datasets:

1. PASCAL-5ⁱ

Download PASCAL VOC2012 devkit (train/val data):
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar
Download PASCAL VOC2012 SDS extended mask annotations from our [Google Drive].

2. COCO-20ⁱ

Download COCO2014 train/val images and annotations:
wget http://images.cocodataset.org/zips/train2014.zip
wget http://images.cocodataset.org/zips/val2014.zip
wget http://images.cocodataset.org/annotations/annotations_trainval2014.zip
Download COCO2014 train/val annotations from our Google Drive: [train2014.zip], [val2014.zip]. (and locate both train2014/ and val2014/ under annotations/ directory).

3. FSS-1000

Download FSS-1000 images and annotations from our [Google Drive].

Create a directory '../Datasets_VAT' for the above three few-shot segmentation datasets and appropriately place each dataset to have following directory structure:

../                         # parent directory
└── Datasets_VAT/
    ├── VOC2012/            # PASCAL VOC2012 devkit
    │   ├── Annotations/
    │   ├── ImageSets/
    │   ├── ...
    │   └── SegmentationClassAug/
    ├── COCO2014/           
    │   ├── annotations/
    │   │   ├── train2014/  # (dir.) training masks (from Google Drive) 
    │   │   ├── val2014/    # (dir.) validation masks (from Google Drive)
    │   │   └── ..some json files..
    │   ├── train2014/
    │   └── val2014/
    └── FSS-1000/           # (dir.) contains 1000 object classes
        ├── abacus/   
        ├── ...
        └── zucchini/

Training

Training on PASCAL-5ⁱ:

  python train.py --config "config/pascal_resnet{50, 101}/pascal_resnet{50, 101}_fold{0, 1, 2, 3}/config.yaml"

Training on COCO-20ⁱ:

  python train.py --config "config/coco_resnet50/coco_resnet50_fold{0, 1, 2, 3}/config.yaml"

Training on FSS-1000:

  python train.py --config "config/fss_resnet{50, 101}/config.yaml"

Evaluation

Download pre-trained weights on Link

Result on PASCAL-5ⁱ:

  python test.py --load "/path_to_pretrained_model/pascal_resnet{50, 101}/pascal_resnet{50, 101}_fold{0, 1, 2, 3}/"

Result on COCO-20ⁱ:

  python test.py --load "/path_to_pretrained_model/coco_resnet50/coco_resnet50_fold{0, 1, 2, 3}/"

Results on FSS-1000:

  python test.py --load "/path_to_pretrained_model/fss_resnet{50, 101}/"

Acknowledgement

We borrow code from public projects (huge thanks to all the projects). We mainly borrow code from HSNet.

Official Implementation of VAT

Related tags

Overview

Semantic correspondence

Few-shot segmentation

Cost Aggregation Is All You Need for Few-Shot Segmentation

Network

Environment Settings

Preparing Few-Shot Segmentation Datasets

1. PASCAL-5ⁱ

2. COCO-20ⁱ

3. FSS-1000

Training

Evaluation

Acknowledgement

Owner

Hamacojr

CSPML (crystal structure prediction with machine learning-based element substitution)

Multi-Scale Progressive Fusion Network for Single Image Deraining

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift (ICCV 2021)

tf2-keras implement yolov5

Voxel Transformer for 3D object detection

"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

An open-source online reverse dictionary.

Accompanying code for the paper "A Kernel Test for Causal Association via Noise Contrastive Backdoor Adjustment".

Flask101 - FullStack Web Development with Python & JS - From TAQWA

An implementation of a sequence to sequence neural network using an encoder-decoder

Pytorch domain adaptation package

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

IhoneyBakFileScan Modify - 批量网站备份文件扫描器，增加文件规则，优化内存占用

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Official Implementation of VAT

Related tags

Overview

Semantic correspondence

Few-shot segmentation

Cost Aggregation Is All You Need for Few-Shot Segmentation

Network

Environment Settings

Preparing Few-Shot Segmentation Datasets

1. PASCAL-5i

2. COCO-20i

3. FSS-1000

Training

Evaluation

Acknowledgement

Owner

Hamacojr

CSPML (crystal structure prediction with machine learning-based element substitution)

Multi-Scale Progressive Fusion Network for Single Image Deraining

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift (ICCV 2021)

tf2-keras implement yolov5

Voxel Transformer for 3D object detection

"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

An open-source online reverse dictionary.

Accompanying code for the paper "A Kernel Test for Causal Association via Noise Contrastive Backdoor Adjustment".

Flask101 - FullStack Web Development with Python & JS - From TAQWA

An implementation of a sequence to sequence neural network using an encoder-decoder

Pytorch domain adaptation package

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

IhoneyBakFileScan Modify - 批量网站备份文件扫描器，增加文件规则，优化内存占用

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

1. PASCAL-5ⁱ

2. COCO-20ⁱ