PyTorch implementation of PSPNet

Last update: Nov 16, 2022

Overview

PSPNet with PyTorch

Unofficial implementation of "Pyramid Scene Parsing Network" (https://arxiv.org/abs/1612.01105). This repository is just for caffe to pytorch model conversion and evaluation.

Requirements

pytorch
click
addict
pydensecrf
protobuf

Preparation

Instead of building the author's caffe implementation, you can convert off-the-shelf caffemodels to pytorch models via the caffe.proto.

1. Compile the `caffe.proto` for Python API

This step can be skipped. FYI.
Download the author's caffe.proto into the libs, not the one in the original caffe.

# For protoc command
pip install protobuf
# This generates ./caffe_pb2.py
protoc --python_out=. caffe.proto

2. Model conversion

Find the caffemodels on the author's page (e.g. pspnet50_ADE20K.caffemodel) and store them to the data/models/ directory.
Convert the caffemodels to .pth file.

python convert.py -c <PATH TO YAML>

Demo

python demo.py -c <PATH TO YAML> -i <PATH TO IMAGE>

With a --no-cuda option, this runs on CPU.
With a --crf option, you can perform a CRF postprocessing.

Evaluation

PASCAL VOC2012 only. Please set the dataset path in config/voc12.yaml.

python eval.py -c config/voc12.yaml

88.1% mIoU (SS) and 88.6% mIoU (MS) on validation set.
NOTE: 3 points lower than caffe implementation. WIP

SS: averaged prediction with flipping (2x)
MS: averaged prediction with multi-scaling (6x) and flipping (2x)
Both: No CRF post-processing

References

Official implementation: https://github.com/hszhao/PSPNet
Chainer implementation: https://github.com/mitmul/chainer-pspnet

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the `caffe.proto` for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Semi-Supervised Graph Prototypical Networks for Hyperspectral Image Classification, IGARSS, 2021.

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Hardware accelerated, batchable and differentiable optimizers in JAX.

Neural Re-rendering for Full-frame Video Stabilization

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

A SAT-based sudoku solver

Reinforcement Learning via Supervised Learning

A port of muP to JAX/Haiku

Best Practices on Recommendation Systems

基于YoloX目标检测+DeepSort算法实现多目标追踪Baseline

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

CPU inference engine that delivers unprecedented performance for sparse models

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

Gif-caption - A straightforward GIF Captioner written in Python

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the caffe.proto for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Semi-Supervised Graph Prototypical Networks for Hyperspectral Image Classification, IGARSS, 2021.

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Hardware accelerated, batchable and differentiable optimizers in JAX.

Neural Re-rendering for Full-frame Video Stabilization

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

A SAT-based sudoku solver

Reinforcement Learning via Supervised Learning

A port of muP to JAX/Haiku

Best Practices on Recommendation Systems

基于YoloX目标检测+DeepSort算法实现多目标追踪Baseline

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

CPU inference engine that delivers unprecedented performance for sparse models

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

Gif-caption - A straightforward GIF Captioner written in Python

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

1. Compile the `caffe.proto` for Python API

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队