Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Last update: Dec 14, 2022

Overview

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CAC)

Xin Lai^*, Zhuotao Tian^*, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia

This is the official PyTorch implementation of our paper Semi-supervised Semantic Segmentation with Directional Context-aware Consistency that has been accepted to 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021). [Paper]

Highlight

Our method achives the state-of-the-art performance on semi-supervised semantic segmentation.
Based on CCT, this Repository also supports efficient distributed training with multiple GPUs.

Get Started

Environment

The repository is tested on Ubuntu 18.04.3 LTS, Python 3.6.9, PyTorch 1.6.0 and CUDA 10.2

pip install -r requirements.txt

Datasets Preparation

Firstly, download the PASCAL VOC Dataset, and the extra annotations from SegmentationClassAug.
Extract the above compression files into your desired path, and make it follow the directory tree as below.

-VOCtrainval_11-May-2012
    -VOCdevkit
        -VOC2012
            -Annotations
            -ImageSets
            -JPEGImages
            -SegmentationClass
            -SegmentationClassAug
            -SegmentationObject

Set 'data_dir' in the config file into '[YOUR_PATH]/VOCtrainval_11-May-2012'.

Training

Firsly, you should download the PyTorch ResNet101 or ResNet50 ImageNet-pretrained weight, and put it into the 'pretrained/' directory using the following commands.

cd Context-Aware-Consistency
mkdir pretrained
cd pretrained
wget https://download.pytorch.org/models/resnet50-19c8e357.pth # ResNet50
wget https://download.pytorch.org/models/resnet101-5d3b4d8f.pth # ResNet101

Run the following commands for training.

train the model on the 1/8 labeled data (the 0-th data list) of PASCAL VOC with the segmentation network and the backbone set to DeepLabv3+ and ResNet50 respectively.

python3 train.py --config configs/voc_cac_deeplabv3+_resnet50_1over8_datalist0.json

train the model on the 1/8 labeled data (the 0-th data list) of PASCAL VOC with the segmentation network and the backbone set to DeepLabv3+ and ResNet101 respectively.

python3 train.py --config configs/voc_cac_deeplabv3+_resnet101_1over8_datalist0.json

Testing

For testing, run the following command.

python3 train.py --config [CONFIG_PATH] --resume [CHECKPOINT_PATH] --test True

Pre-trained Models

For your convenience, you can download some of the pre-trained models from Here.

Related Repositories

This repository highly depends on the CCT repository at https://github.com/yassouali/CCT. We thank the authors of CCT for their great work and clean code.

Besides, we also borrow some codes from the following repositories.

MoCo at https://github.com/facebookresearch/moco.
Deeplabv3+ at https://github.com/jfzhang95/pytorch-deeplab-xception.
Semseg at https://github.com/hszhao/semseg

Thanks a lot for their great work.

Citation

If you find this project useful, please consider citing:

@inproceedings{lai2021cac,
  title     = {Semi-supervised Semantic Segmentation with Directional Context-aware Consistency},
  author    = {Xin Lai, Zhuotao Tian, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang and Jiaya Jia},
  booktitle = {CVPR},
  year      = {2021}
}

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Related tags

Overview

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CAC)

Highlight

Get Started

Environment

Datasets Preparation

Training

Testing

Pre-trained Models

Related Repositories

Citation

Owner

DV Lab

Image classification for projects and researches

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions

Official Implementation of VAT

CVNets: A library for training computer vision networks

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

A transformer-based method for Healthcare Image Captioning in Vietnamese

PyTorch implementation of adversarial patch

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

Place holder for HOPE: a human-centric and task-oriented MT evaluation framework using professional post-editing

A keras-based real-time model for medical image segmentation (CFPNet-M)

PyTorch Lightning implementation of Automatic Speech Recognition

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

mlpack: a scalable C++ machine learning library --

Learn other languages using artificial intelligence with python.

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Related tags

Overview

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CAC)

Highlight

Get Started

Environment

Datasets Preparation

Training

Testing

Pre-trained Models

Related Repositories

Citation

Owner

DV Lab

Image classification for projects and researches

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions

Official Implementation of VAT

CVNets: A library for training computer vision networks

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

A transformer-based method for Healthcare Image Captioning in Vietnamese

PyTorch implementation of adversarial patch

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

Place holder for HOPE: a human-centric and task-oriented MT evaluation framework using professional post-editing

A keras-based real-time model for medical image segmentation (CFPNet-M)

PyTorch Lightning implementation of Automatic Speech Recognition

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

mlpack: a scalable C++ machine learning library --

Learn other languages ​​using artificial intelligence with python.

Learn other languages using artificial intelligence with python.