This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Last update: Jan 08, 2023

Overview

Semantic Segmentation on PyTorch

English | 简体中文

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Installation

# semantic-segmentation-pytorch dependencies
pip install ninja tqdm

# follow PyTorch installation in https://pytorch.org/get-started/locally/
conda install pytorch torchvision -c pytorch

# install PyTorch Segmentation
git clone https://github.com/Tramac/awesome-semantic-segmentation-pytorch.git

Usage

Train

Single GPU training

# for example, train fcn32_vgg16_pascal_voc:
python train.py --model fcn32s --backbone vgg16 --dataset pascal_voc --lr 0.0001 --epochs 50

Multi-GPU training

# for example, train fcn32_vgg16_pascal_voc with 4 GPUs:
export NGPUS=4
python -m torch.distributed.launch --nproc_per_node=$NGPUS train.py --model fcn32s --backbone vgg16 --dataset pascal_voc --lr 0.0001 --epochs 50

Evaluation

Single GPU evaluating

# for example, evaluate fcn32_vgg16_pascal_voc
python eval.py --model fcn32s --backbone vgg16 --dataset pascal_voc

Multi-GPU evaluating

# for example, evaluate fcn32_vgg16_pascal_voc with 4 GPUs:
export NGPUS=4
python -m torch.distributed.launch --nproc_per_node=$NGPUS eval.py --model fcn32s --backbone vgg16 --dataset pascal_voc

Demo

cd ./scripts
#for new users:
python demo.py --model fcn32s_vgg16_voc --input-pic ../tests/test_img.jpg
#you should add 'test.jpg' by yourself
python demo.py --model fcn32s_vgg16_voc --input-pic ../datasets/test.jpg

.{SEG_ROOT}
├── scripts
│   ├── demo.py
│   ├── eval.py
│   └── train.py

Support

Model

DETAILS for model & backbone.

.{SEG_ROOT}
├── core
│   ├── models
│   │   ├── bisenet.py
│   │   ├── danet.py
│   │   ├── deeplabv3.py
│   │   ├── deeplabv3+.py
│   │   ├── denseaspp.py
│   │   ├── dunet.py
│   │   ├── encnet.py
│   │   ├── fcn.py
│   │   ├── pspnet.py
│   │   ├── icnet.py
│   │   ├── enet.py
│   │   ├── ocnet.py
│   │   ├── psanet.py
│   │   ├── cgnet.py
│   │   ├── espnet.py
│   │   ├── lednet.py
│   │   ├── dfanet.py
│   │   ├── ......

Dataset

You can run script to download dataset, such as:

cd ./core/data/downloader
python ade20k.py --download-dir ../datasets/ade

Dataset	training set	validation set	testing set
VOC2012	1464	1449	✘
VOCAug	11355	2857	✘
ADK20K	20210	2000	✘
Cityscapes	2975	500	✘
COCO
SBU-shadow	4085	638	✘
LIP(Look into Person)	30462	10000	10000

.{SEG_ROOT}
├── core
│   ├── data
│   │   ├── dataloader
│   │   │   ├── ade.py
│   │   │   ├── cityscapes.py
│   │   │   ├── mscoco.py
│   │   │   ├── pascal_aug.py
│   │   │   ├── pascal_voc.py
│   │   │   ├── sbu_shadow.py
│   │   └── downloader
│   │       ├── ade20k.py
│   │       ├── cityscapes.py
│   │       ├── mscoco.py
│   │       ├── pascal_voc.py
│   │       └── sbu_shadow.py

Result

PASCAL VOC 2012

Methods	Backbone	TrainSet	EvalSet	crops_size	epochs	JPU	Mean IoU	pixAcc
FCN32s	vgg16	train	val	480	60	✘	47.50	85.39
FCN16s	vgg16	train	val	480	60	✘	49.16	85.98
FCN8s	vgg16	train	val	480	60	✘	48.87	85.02
FCN32s	resnet50	train	val	480	50	✘	54.60	88.57
PSPNet	resnet50	train	val	480	60	✘	63.44	89.78
DeepLabv3	resnet50	train	val	480	60	✘	60.15	88.36

Note: lr=1e-4, batch_size=4, epochs=80.

Overfitting Test

See TEST for details.

.{SEG_ROOT}
├── tests
│   └── test_model.py

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Related tags

Overview

Semantic Segmentation on PyTorch

Installation

Usage

Train

Evaluation

Demo

Support

Model

Dataset

Result

Overfitting Test

To Do

References

Owner

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Classification of ecg datas for disease detection

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Contains source code for the winning solution of the xView3 challenge

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

A reimplementation of DCGAN in PyTorch

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper

Receptive Field Block Net for Accurate and Fast Object Detection, ECCV 2018

The Submission for SIMMC 2.0 Challenge 2021

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)