Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Last update: Dec 19, 2022

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

This repo is the pytorch implementation of the following paper:

Adversarial Learning for Semi-supervised Semantic Segmentation
Wei-Chih Hung, Yi-Hsuan Tsai, Yan-Ting Liou, Yen-Yu Lin, and Ming-Hsuan Yang
Proceedings of the British Machine Vision Conference (BMVC), 2018.

Contact: Wei-Chih Hung (whung8 at ucmerced dot edu)

The code are heavily borrowed from a pytorch DeepLab implementation (Link). The baseline model is DeepLabv2-Resnet101 without multiscale training and CRF post processing, which yields meanIOU 73.6% on the VOC2012 validation set.

Please cite our paper if you find it useful for your research.

@inproceedings{Hung_semiseg_2018,
  author = {W.-C. Hung and Y.-H. Tsai and Y.-T. Liou and Y.-Y. Lin and M.-H. Yang},
  booktitle = {Proceedings of the British Machine Vision Conference (BMVC)},
  title = {Adversarial Learning for Semi-supervised Semantic Segmentation},
  year = {2018}
}

Prerequisite

CUDA/CUDNN
pytorch >= 0.2 (We only support 0.4 for evaluation. Will migrate the code to 0.4 soon.)
python-opencv >=3.4.0 (3.3 will cause extra GPU memory on multithread data loader)

Installation

Clone this repo

git clone https://github.com/hfslyc/AdvSemiSeg.git

Place VOC2012 dataset in AdvSemiSeg/dataset/VOC2012. For training, you will need the augmented labels (Download). The folder structure should be like:

AdvSemiSeg/dataset/VOC2012/JPEGImages
                          /SegmentationClassAug

Testing on VOC2012 validation set with pretrained models

python evaluate_voc.py --pretrained-model semi0.125 --save-dir results

It will download the pretrained model with 1/8 training data and evaluate on the VOC2012 val set. The colorized images will be saved in results/ and the detailed class IOU will be saved in results/result.txt. The mean IOU should be around 68.8%.

Available --pretrained-model options: semi0.125, semi0.25, semi0.5 , advFull.

Example visualization results

Training on VOC2012

python train.py --snapshot-dir snapshots \
                --partial-data 0.125 \
                --num-steps 20000 \
                --lambda-adv-pred 0.01 \
                --lambda-semi 0.1 --semi-start 5000 --mask-T 0.2

The parameters correspond to those in Table 5 of the paper.

To evaluate trained model, execute the following:

python evaluate_voc.py --restore-from snapshots/VOC_20000.pth \
                       --save-dir results

Changelog

07/24/2018: Update BMVC results

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Related tags

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

Prerequisite

Installation

Testing on VOC2012 validation set with pretrained models

Example visualization results

Training on VOC2012

Changelog

Owner

Wayne Hung

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

Robust Lane Detection via Expanded Self Attention (WACV 2022)

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

Keras implementation of Real-Time Semantic Segmentation on High-Resolution Images

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022)

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

Awesome Weak-Shot Learning

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

ParmeSan: Sanitizer-guided Greybox Fuzzing

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

🎁 3,000,000+ Unsplash images made available for research and machine learning

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

Face Detection & Age Gender & Expression & Recognition

Face uncertainty quantification or estimation using PyTorch.

Teaches a student network from the knowledge obtained via training of a larger teacher network

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')