Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

《Lerning n Intrinsic Grment Spce for Interctive Authoring of Grment Animtion》

[CVPR2021] De-rendering the World's Revolutionary Artefacts

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

No Code AI/ML platform

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

Download from Onlyfans.com.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

LUKE -- Language Understanding with Knowledge-based Embeddings

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

Python TFLite scripts for detecting objects of any class in an image without knowing their label.

Source code for The Power of Many: A Physarum Swarm Steiner Tree Algorithm

New approach to benchmark VQA models

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Train Yolov4 using NBX-Jobs