ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Last update: Dec 31, 2022

Related tags

Deep Learning ICNet

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

by Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia, details are in project page.

Introduction

Based on PSPNet, this repository is build for evaluation in ICNet. For installation, please follow the description in PSPNet repository (support CUDA 7.0/7.5 + cuDNN v4).

Usage

Clone the repository recursively:

git clone --recursive https://github.com/hszhao/ICNet.git

Build Caffe and matcaffe:

cd $ICNET_ROOT/PSPNet
cp Makefile.config.example Makefile.config
vim Makefile.config
make -j8 && make matcaffe
cd ..

Evaluation mIoU:
- Evaluation code is in folder 'evaluation'.
- Download trained models and put them in folder 'evaluation/model':
  - icnet_cityscapes_train_30k.caffemodel: GoogleDrive
    
    (31M, md5: c7038630c4b6c869afaaadd811bdb539; train on trainset for 30k)
  - icnet_cityscapes_trainval_90k.caffemodel: GoogleDrive
    
    (31M, md5: 4f4dd9eecd465dd8de7e4cf88ba5d5d5; train on trainvalset for 90k)
- Modify the related paths in 'eval_all.m':
  - Mainly variables 'data_root' and 'eval_list', and your image list for evaluation should be similar to that in folder 'evaluation/samplelist' if you use this evaluation code structure.
```
cd evaluation
vim eval_all.m
```
- Run the evaluation scripts:
```
./run.sh
```
Evaluation time:
- To get inference time as accurate as possible, it's suggested to make sure the GPU card with specified ID in script 'test_time.sh' is empty (without other processes executing)
- Run the evaluation scripts:
```
./test_time.sh
```
Results:
- Prediction results will show in folder 'evaluation/mc_result' and the expected scores are:
  - ICNet train on trainset for 30K, evaluated on valset (mIoU/pAcc): 67.7/94.5
  - ICNet train on trainvalset for 90K, evaluated on testset (mIoU): 69.5
- Log information of inference time will be in file 'time.log', approximately 33~36ms on TitanX.
Demo video:
- Video processed by ICNet on cityscapes dataset:
  - Alpha blending with value as 0.5: Video

Citation

If ICNet is useful for your research, please consider citing:

@inproceedings{zhao2018icnet,
  title={ICNet for Real-Time Semantic Segmentation on High-Resolution Images},
  author={Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya},
  booktitle={ECCV},
  year={2018}
}

Questions

Please contact '[email protected]'

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Related tags

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Introduction

Usage

Citation

Questions

Owner

Hengshuang Zhao

GrailQA: Strongly Generalizable Question Answering

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

Adaptive Graph Convolution for Point Cloud Analysis

A fast model to compute optical flow between two input images.

Offline Reinforcement Learning with Implicit Q-Learning

Official repository of the paper 'Essentials for Class Incremental Learning'

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Code for the tech report Toward Training at ImageNet Scale with Differential Privacy

SNE-RoadSeg in PyTorch, ECCV 2020

The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.

Interpretable-contrastive-word-mover-s-embedding

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

An easy-to-use app to visualise attentions of various VQA models.

"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8

MERLOT: Multimodal Neural Script Knowledge Models

Recommendationsystem - Movie-recommendation - matrixfactorization colloborative filtering recommendation system user

Implementation of Sequence Generative Adversarial Nets with Policy Gradient