FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

Last update: Dec 29, 2022

Related tags

Overview

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation

Official implementation of FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.
A Faster, Stronger and Lighter framework for semantic segmentation, achieving the state-of-the-art performance and more than 3x acceleration.

@inproceedings{wu2019fastfcn,
  title     = {FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation},
  author    = {Wu, Huikai and Zhang, Junge and Huang, Kaiqi and Liang, Kongming and Yu Yizhou},
  booktitle = {arXiv preprint arXiv:1903.11816},
  year = {2019}
}

Contact: Hui-Kai Wu ([email protected])

Update

2020-04-15: Now support inference on a single image !!!

CUDA_VISIBLE_DEVICES=0,1,2,3 python -m experiments.segmentation.test_single_image --dataset [pcontext|ade20k] \
    --model [encnet|deeplab|psp] --jpu [JPU|JPU_X] \
    --backbone [resnet50|resnet101] [--ms] --resume {MODEL} --input-path {INPUT} --save-path {OUTPUT}

2020-04-15: New joint upsampling module is now available !!!

--jpu [JPU|JPU_X]: JPU is the original module in the arXiv paper; JPU_X is a pyramid version of JPU.

2020-02-20: FastFCN can now run on every OS with PyTorch>=1.1.0 and Python==3.*.*

Replace all C/C++ extensions with pure python extensions.

Version

Original code, producing the results reported in the arXiv paper. [branch:v1.0.0]
Pure PyTorch code, with torch.nn.DistributedDataParallel and torch.nn.SyncBatchNorm. [branch:latest]
Pure Python code. [branch:master]

Overview

Framework

Joint Pyramid Upsampling (JPU)

Install

PyTorch >= 1.1.0 (Note: The code is test in the environment with python=3.6, cuda=9.0)

Download FastFCN

git clone https://github.com/wuhuikai/FastFCN.git
cd FastFCN

Install Requirements
```
nose
tqdm
scipy
cython
requests
```

Train and Test

PContext

python -m scripts.prepare_pcontext

Method	Backbone	mIoU	FPS	Model	Scripts
EncNet	ResNet-50	49.91	18.77
EncNet+JPU (ours)	ResNet-50	51.05	37.56	GoogleDrive	bash
PSP	ResNet-50	50.58	18.08
PSP+JPU (ours)	ResNet-50	50.89	28.48	GoogleDrive	bash
DeepLabV3	ResNet-50	49.19	15.99
DeepLabV3+JPU (ours)	ResNet-50	50.07	20.67	GoogleDrive	bash
EncNet	ResNet-101	52.60 (MS)	10.51
EncNet+JPU (ours)	ResNet-101	54.03 (MS)	32.02	GoogleDrive	bash

ADE20K

python -m scripts.prepare_ade20k

Training Set

Method	Backbone	mIoU (MS)	Model	Scripts
EncNet	ResNet-50	41.11
EncNet+JPU (ours)	ResNet-50	42.75	GoogleDrive	bash
EncNet	ResNet-101	44.65
EncNet+JPU (ours)	ResNet-101	44.34	GoogleDrive	bash

Training Set + Val Set

Method	Backbone	FinalScore (MS)	Model	Scripts
EncNet+JPU (ours)	ResNet-50		GoogleDrive	bash
EncNet	ResNet-101	55.67
EncNet+JPU (ours)	ResNet-101	55.84	GoogleDrive	bash

Note: EncNet (ResNet-101) is trained with crop_size=576, while EncNet+JPU (ResNet-101) is trained with crop_size=480 for fitting 4 images into a 12G GPU.

Visual Results

Dataset	Input	GT	EncNet	Ours
PContext
ADE20K

More Visual Results

Acknowledgement

Code borrows heavily from PyTorch-Encoding.

Comments

Some problem when running test.py and train.py

Hi, I am a beginner in deep learning. Some problem occurred when I was running the code. First, I use the command 「 tar -xvf encnet_jpu_res50_pcontext.pth.tar 」 to extract the tar file, but it fails. Second, if i successfully extract the file and get checkpoint, which file should I put my checkpoint in ? Where should I extract my checkpoint file to? Thank You!

opened by pp00704831 18
why i remove JPU，I also can train model？

Why does the code still execute without error when I delete the JPU module?（/FastFCN/encoding/nn/customize.py），I also can train model？ These are my commands :(I did load the JPU module) CUDA_VISIBLE_DEVICES=4,5,6,7 python train.py --dataset pcontext --model encnet --jpu --aux --se-loss --backbone resnet101 --checkname encnet_res101_pcontext

opened by E18301194 17
Segmentation fault

I think this problem is caused by my previous pytorch problem,so maybe i have to solve pytorch first.Could you give me some help? gcc:4.8 pytorch:1.1.0 python:3.5 and how could i change the pytorch version to 1.0.0?pip install torch==1.0?

opened by Anikily 12
Performance Issue

Thanks for your work. I have tried this script: https://github.com/wuhuikai/FastFCN/blob/master/experiments/segmentation/scripts/encnet_res50_pcontext.sh with the hardware and software: 4xTitanXp, Ubuntu16.04, CUDA9.0, PyToch1.0

But I can't reproduce the performance reported in your paper. I got pixAcc: 0.7747, mIoU: 0.4785 for single-scale, and pixAcc: 0.7833, mIoU: 0.4898 for multi-scale.

I would appreciate your help. Thanks for your consideration.
bug

opened by tonysy 12
FastFCN has been supported by MMSegmentation.

Hi, right now FastFCN has been supported by MMSegmentation. We do find using JPU with smaller feature maps from backbone could get similar or higher performance than original models with larger feature maps.

There is still something to do for us, for example, we do not find obviously improvement about FPS in our implementation, thus we would try to figure it out in the future.

Anyway, thanks for your work and hope more people from community could use FastFCN.

Best,

opened by MengzhangLI 9
RuntimeError: Failed downloading

Hi, thanks for your work. I try to run your code to train a model on the pascalContext dataset.But I got the following error: RuntimeError: Failed downloading url https://hangzh.s3.amazonaws.com/encoding/models/resnet50-ebb6acbb.zip I find the problem is I can not download the pretrained model. I find the author no longer provide the pretrained resnet model. https://github.com/zhanghang1989/PyTorch-Encoding/issues/273

So, How can I solve this problem. Thanks for your consideration.

opened by bufferXia 9
How could I set "resume" while running test_single_image?

Hello!

When I run test_single_image.py, I tried to set resume as path of resnet101-2a57e44d.pth and encountered an error.

File "G:/gitfolder/FastFCN/experiments/segmentation/test_single_image.py", line 43, in test model.load_state_dict(checkpoint['state_dict'], strict=False) KeyError: 'state_dict

I doubted that there existed a problem with "resume". Waiting for your reply.

Thank you!

opened by CN-HaoJiang 8
Questions about the SE-loss and Aux-loss

Hi, first thank you for the great work. I just checked the codes and also had run some scripts. I am confused with the final loss which is composited with three individual losses. could you tell what is the se-loss and the aux-loss used for.

opened by meanmee 7
Backbone weights download links not working anymore

Download links for the backbone do not seem to work anymore.

I've tested with Resnet50 (https://hangzh.s3.amazonaws.com/encoding/models/resnet50-ebb6acbb.zip) and Resnet 101 (https://hangzh.s3.amazonaws.com/encoding/models/resnet101-2a57e44d.zip) too.

I also tried to use torchivision weights instead, but I got matching errors when trying to load them.

Could you consider reuploading the weights? That would be very helpful!

opened by Khroto 6
Segmentation Fault

我執行以下 command 準備 train model 但是發生 segmentation fault 有人有這個問題嗎 ? 謝謝幫忙 !

run : CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --dataset pcontext --model encnet --jpu --aux --se-loss --backbone resnet101 --checkname encnet_res101_pcontext

crashed : Using poly LR Scheduler! Starting Epoch: 0 Total Epoches: 80 0%| | 0/312 [00:00<?, ?it/s] =>Epoches 0, learning rate = 0.0010, previous best = 0.0000 Segmentation fault

//------------ Nvidia GPU : Tesla P100-PCIE 16G x 4 CPU : GenuineIntel x 18 , Memory 140G totally

opened by SimonTsungHanKuo 6

Need your suggestions

Hi, i have designed this SPP module for my network. But i am also interested in your work to replace my his module with JPU. Would you like to give me any suggestions? here is my implementation

class SPP(nn.Module): def init(self, pool_sizes): super(SPP, self).init() self.pool_sizes = pool_sizes

def forward(self, x):
    h, w = x.shape[2:]
    k_sizes = []
    strides = []
    for pool_size in self.pool_sizes:
        k_sizes.append((int(h / pool_size), int(w / pool_size)))
        strides.append((int(h / pool_size), int(w / pool_size)))

    spp_sum = x

    for i in range(len(self.pool_sizes)):
        out = F.avg_pool2d(x, k_sizes[i], stride=strides[i], padding=0)
        out = F.upsample(out, size=(h, w), mode="bilinear")
        spp_sum = spp_sum + out

    return spp_sum

opened by haideralimughal 5

add resnest and xception65

Copy Resnest and xception65 from Pytorch-Encoding, and xception65 only can be used without pretrained models.

Pls be careful as there are many changes!!

I test it on my own server, and everything seems ok. As a caution, maybe you could test it by yourself first.My FastFCN

I don't change the Readme.md and *.sh. Maybe you can rectify it if you agree this request.

If the server resources are not tight, I will run the encnet+jpu+resnest101+pcontext and encnet+jpu_x+resnest101+pcontext, I will share you the results at issues or pull another request about Readme.md with my pth.tar.

Thanks for your work again.

opened by tjj1998 1

Releases(v1.0.0)

v1.0.0(Feb 13, 2020)

Source code(tar.gz)
Source code(zip)

Owner

Wu Huikai

GitHub Repository http://wuhuikai.me/FastFCNProject

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators This is our Pytorch implementation for t

12 Jul 22, 2022

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

A CNN implementation using only numpy. Supports multidimensional images, stride, etc. Speed up due to heavy use of slicing and mathematical simplification..

2 Nov 30, 2021

Real-time Object Detection for Streaming Perception, CVPR 2022

StreamYOLO Real-time Object Detection for Streaming Perception Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Sun Jian Real-time Object Detection

237 Dec 27, 2022

HuSpaCy: industrial-strength Hungarian natural language processing

HuSpaCy: Industrial-strength Hungarian NLP HuSpaCy is a spaCy model and a library providing industrial-strength Hungarian language processing faciliti

120 Dec 14, 2022

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion

Drone Detection using Thermal Signature This repository highlights the work for night-time drone detection using a using an Optris PI Lightweight ther

6 Dec 31, 2022

Joint Gaussian Graphical Model Estimation: A Survey

Joint Gaussian Graphical Model Estimation: A Survey Test Models Fused graphical lasso [1] Group graphical lasso [1] Graphical lasso [1] Doubly joint s

1 Aug 10, 2022

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios This is the official TensorFlow implementation of MetaTTE in the

4 Dec 14, 2022

Fast Scattering Transform with CuPy/PyTorch

Announcement 11/18 This package is no longer supported. We have now released kymatio: http://www.kymat.io/ , https://github.com/kymatio/kymatio which

289 Dec 07, 2022

Code for "NeRS: Neural Reflectance Surfaces for Sparse-View 3D Reconstruction in the Wild," in NeurIPS 2021

Code for Neural Reflectance Surfaces (NeRS) [arXiv] [Project Page] [Colab Demo] [Bibtex] This repo contains the code for NeRS: Neural Reflectance Surf

234 Dec 30, 2022

Preparation material for Dropbox interviews

Dropbox-Onsite-Interviews A guide for the Dropbox onsite interview! The Dropbox interview question bank is very small. The bank has been in a Chinese

386 Dec 31, 2022

Implements a fake news detection program using classifiers.

Fake news detection Implements a fake news detection program using classifiers for Data Mining course at UoA. Description The project is the categoriz

1 Jan 09, 2022

Group Activity Recognition with Clustered Spatial Temporal Transformer

GroupFormer Group Activity Recognition with Clustered Spatial-TemporalTransformer Backbone Style Action Acc Activity Acc Config Download Inv3+flow+pos

28 Dec 12, 2022

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

InfoSwap: Information Bottleneck Disentanglement for Identity Swapping Code usage Please check out the user manual page. Paper Gege Gao, Huaibo Huang,

56 Dec 20, 2022

TensorFlow, PyTorch and Numpy layers for generating Orthogonal Polynomials

OrthNet TensorFlow, PyTorch and Numpy layers for generating multi-dimensional Orthogonal Polynomials 1. Installation 2. Usage 3. Polynomials 4. Base C

29 May 25, 2022

A quantum game modeling of pandemic (QHack 2022)

Contributors: @JongheumJung, @YoonjaeChung, @GyunghunKim Abstract In the regime of a global pandemic, leaders around the world need to consider variou

8 Apr 03, 2022

Image morphing without reference points by applying warp maps and optimizing over them.

Differentiable Morphing Image morphing without reference points by applying warp maps and optimizing over them. Differentiable Morphing is machine lea

380 Dec 19, 2022

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

TargetAllDomainObjects A python wrapper to run a command on against all users/co

19 Dec 13, 2022

Instant neural graphics primitives: lightning fast NeRF and more

Instant Neural Graphics Primitives Ever wanted to train a NeRF model of a fox in under 5 seconds? Or fly around a scene captured from photos of a fact

10.6k Jan 01, 2023

Read number plates with https://platerecognizer.com/

HASS-plate-recognizer Read vehicle license plates with https://platerecognizer.com/ which offers free processing of 2500 images per month. You will ne

69 Dec 30, 2022

Benchmark tools for Compressive LiDAR-to-map registration

Benchmark tools for Compressive LiDAR-to-map registration This repo contains the released version of code and datasets used for our IROS 2021 paper: "

9 Nov 24, 2022

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

Related tags

Overview

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation

Update

Version

Overview

Framework

Joint Pyramid Upsampling (JPU)

Install

Train and Test

PContext

ADE20K

Training Set

Training Set + Val Set

Visual Results

Acknowledgement

Comments

Releases(v1.0.0)

v1.0.0(Feb 13, 2020)

Owner

Wu Huikai

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

Real-time Object Detection for Streaming Perception, CVPR 2022

HuSpaCy: industrial-strength Hungarian natural language processing

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion

Joint Gaussian Graphical Model Estimation: A Survey

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios

Fast Scattering Transform with CuPy/PyTorch

Code for "NeRS: Neural Reflectance Surfaces for Sparse-View 3D Reconstruction in the Wild," in NeurIPS 2021

Preparation material for Dropbox interviews

Implements a fake news detection program using classifiers.

Group Activity Recognition with Clustered Spatial Temporal Transformer

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

TensorFlow, PyTorch and Numpy layers for generating Orthogonal Polynomials

A quantum game modeling of pandemic (QHack 2022)

Image morphing without reference points by applying warp maps and optimizing over them.

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

Instant neural graphics primitives: lightning fast NeRF and more

Read number plates with https://platerecognizer.com/

Benchmark tools for Compressive LiDAR-to-map registration