TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

Differentiable rasterization applied to 3D model simplification tasks

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Diagnostic tests for linguistic capacities in language models

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

deep learning model that learns to code with drawing in the Processing language

A Python 3 package for state-of-the-art statistical dimension reduction methods

SAMO: Streaming Architecture Mapping Optimisation

GoodNews Everyone! Context driven entity aware captioning for news images

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

This repository contains the source code of our work on designing efficient CNNs for computer vision

Focal Loss for Dense Rotation Object Detection

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).

Old Photo Restoration (Official PyTorch Implementation)

CLIP+FFT text-to-image

Project repo for Learning Category-Specific Mesh Reconstruction from Image Collections

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

B-cos Networks: Attention is All we Need for Interpretability