Applying PVT to Semantic Segmentation

Last update: Nov 30, 2022

Related tags

Deep Learning PVTv2-Seg

Overview

Applying PVT to Semantic Segmentation

Here, we take MMSegmentation v0.13.0 as an example, applying PVTv2 to SemanticFPN.

For details see Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions.

If you use this code for a paper please cite:

@misc{wang2021pyramid,
      title={Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions}, 
      author={Wenhai Wang and Enze Xie and Xiang Li and Deng-Ping Fan and Kaitao Song and Ding Liang and Tong Lu and Ping Luo and Ling Shao},
      year={2021},
      eprint={2102.12122},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Usage

Install MMSegmentation.

Data preparation

First, prepare ADE20K according to the guidelines in MMSegmentation.

Then, download the weights pretrained on ImageNet at here, and put them in a folder pretrained/

Results and models

Backbone	Iters	mIoU	Config
PVTv2-B0 + Semantic FPN	40K	37.2	config
PVTv2-B1 + Semantic FPN	40K	42.5	config
PVTv2-B2 + Semantic FPN	40K	45.2	config
PVTv2-B3 + Semantic FPN	40K	47.3	config
PVTv2-B4 + Semantic FPN	40K	47.9	config
PVTv2-B5 + Semantic FPN	40K	48.7	config

Evaluation

To evaluate PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_test.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py /path/to/checkpoint_file 8 --out results.pkl --eval mIoU

Training

To train PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_train.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py 8

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Applying PVT to Semantic Segmentation

Related tags

Overview

Applying PVT to Semantic Segmentation

Usage

Data preparation

Results and models

Evaluation

Training

License

Owner

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper

Forecasting directional movements of stock prices for intraday trading using LSTM and random forest

A CNN model to detect hand gestures.

A python library for face detection and features extraction based on mediapipe library

Official Implementation of Neural Splines

Library for time-series-forecasting-as-a-service.

Hough Transform and Hough Line Transform Using OpenCV

Gesture Volume Control Using OpenCV and MediaPipe

A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️

Non-Imaging Transient Reconstruction And TEmporal Search (NITRATES)

Contrastive Learning for Metagenomic Binning

Back to Basics: Efficient Network Compression via IMP

Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

Object Detection with YOLOv3

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

OverFeat is a Convolutional Network-based image classifier and feature extractor.

StarGAN-ZSVC: Unofficial PyTorch Implementation

A bare-bones Python library for quality diversity optimization.