ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Last update: Jan 03, 2023

Overview

ST++

This is the official PyTorch implementation of our paper:

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation.
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi and Yang Gao.

Getting Started

Data Preparation

Pre-trained Model

ResNet-50 | ResNet-101 | DeepLabv2-ResNet-101

Dataset

Pascal | Augmented Masks | Cityscapes | Class Mapped Masks

File Organization

├── ./pretrained
    ├── resnet50.pth
    ├── resnet101.pth
    └── deeplabv2_resnet101_coco_pretrained.pth
    
├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass    # replace the official folder with above augmented masks 
    
├── [Your Cityscapes Path]
    ├── gtFine               # replace the official folder with above class mapped masks 
    └── leftImg8bit

Training and Testing

export semi_setting='pascal/1_8/split_0'

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset pascal --data-root [Your Pascal Path] \
  --batch-size 16 --backbone resnet50 --model deeplabv3plus \
  --labeled-id-path dataset/splits/$semi_setting/labeled.txt \
  --unlabeled-id-path dataset/splits/$semi_setting/unlabeled.txt \
  --pseudo-mask-path outdir/pseudo_masks/$semi_setting \
  --save-path outdir/models/$semi_setting

This script is for our ST framework. To run ST++, add --plus --reliable-id-path outdir/reliable_ids/$semi_setting.

Acknowledgement

The DeepLabv2 MS COCO pre-trained model is borrowed and converted from AdvSemiSeg. The image partitions are borrowed from Context-Aware-Consistency and PseudoSeg. Part of the training hyper-parameters and network structures are adapted from PyTorch-Encoding. The strong data augmentations are borrowed from MoCo v2 and PseudoSeg.

AdvSemiSeg: https://github.com/hfslyc/AdvSemiSeg.
Context-Aware-Consistency: https://github.com/dvlab-research/Context-Aware-Consistency.
PseudoSeg: https://github.com/googleinterns/wss.
PyTorch-Encoding: https://github.com/zhanghang1989/PyTorch-Encoding.
MoCo: https://github.com/facebookresearch/moco.
OpenSelfSup: https://github.com/open-mmlab/OpenSelfSup.

Thanks a lot for their great works!

Citation

If you find this project useful, please consider citing:

@article{yang2021st++,
  title={ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation},
  author={Yang, Lihe and Zhuo, Wei and Qi, Lei and Shi, Yinghuan and Gao, Yang},
  journal={arXiv preprint arXiv:2106.05095},
  year={2021}
}

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Related tags

Overview

ST++

Getting Started

Data Preparation

Pre-trained Model

Dataset

File Organization

Training and Testing

Acknowledgement

Citation

Owner

Lihe Yang

This is a Python wrapper for TA-LIB based on Cython instead of SWIG.

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

Official Implementation for Fast Training of Neural Lumigraph Representations using Meta Learning.

Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

An addon uses SMPL's poses and global translation to drive cartoon character in Blender.

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

This is a Python Module For Encryption, Hashing And Other stuff

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

PyTorch implementation of EfficientNetV2

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

A graph-to-sequence model for one-step retrosynthesis and reaction outcome prediction.

Training deep models using anime, illustration images.