"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

Last update: Feb 24, 2022

Overview

Segmenter-based-on-OpenMMLab

"Segmenter: Transformer for Semantic Segmentation, arxiv 2105.05633." reproduced via mmsegmentation.

We reproduce Segmenter via mmsegmentation based on official open-sourced code.

Environment

python=3.7
pytorch=1.7.1
torchvision=0.8.2
cudatoolkit=10.1
mmcv-full=1.3.10
mmsegmentation=0.16.0

Note: You should install pytorch with a version higher than 1.7, because the pretrained model of DeiT is saved via 1.7+ pytorch. Otherwise you may encounter some errors while loading the state_dict.

Results on ADE20K

The passwds of download links are all 'nopw'.

Exp	Name	backbone	Our mIoU-SS	mIoU in paper	Resolution	BS	Download
4th line in Table3	Seg-B†-Linear/16	DeiT-B	46.83	47.10	512x512	8	model	config	log
4th line in Table6	Seg-B†-Mask/16	DeiT-B	48.41	47.67	512x512	8	model	config	log
6th line in Table3	Seg-B -Linear/16	ViT-B	45.70	45.69	512x512	8	model	config	log

Owner

EricKani

GitHub Repository

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning Tensorflow code and models for the paper: Large Scale Fine-Grained Categ

187 Oct 01, 2022

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

STORM Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit [Install Instructions] [Paper] [Website] This package contains code

101 Dec 12, 2022

Reverse engineer your pytorch vision models, in style

🔍 Rover Reverse engineer your CNNs, in style Rover will help you break down your CNN and visualize the features from within the model. No need to wri

32 Sep 24, 2022

Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.

Pose Detection Project Description: Human pose estimation from video plays a critical role in various applications such as quantifying physical exerci

2 Jan 17, 2022

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Toward Practical Monocular Indoor Depth Estimation Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su [arXiv] [project site] DistDe

122 Dec 13, 2022

Relative Positional Encoding for Transformers with Linear Complexity

Stochastic Positional Encoding (SPE) This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Lin

48 Nov 16, 2022

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

Self-supervised Graph-level Representation Learning with Local and Global Structure Introduction This project is an implementation of ``Self-supervise

50 Dec 09, 2022

UMich 500-Level Mobile Robotics Course

MOBILE ROBOTICS: METHODS & ALGORITHMS - WINTER 2022 University of Michigan - NA 568/EECS 568/ROB 530 For slides, lecture notes, and example codes, see

393 Dec 29, 2022

Robotics with GPU computing

Robotics with GPU computing Cupoch is a library that implements rapid 3D data processing for robotics using CUDA. The goal of this library is to imple

625 Jan 07, 2023

functorch is a prototype of JAX-like composable function transforms for PyTorch.

1.2k Jan 09, 2023

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery This paper (CoANet) has been published in IEEE TIP 2021. This code i

53 Dec 03, 2022

Link prediction using Multiple Order Local Information (MOLI)

Understanding the network formation pattern for better link prediction Authors: [e

0 Oct 18, 2021

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing Figure: High-quality facial attributes editing results with InterFaceGA

1.3k Dec 29, 2022

Repo 4 basic seminar §How to make human machine readable"

WORK IN PROGRESS... Notebooks from the Seminar: Human Machine Readable WS21/22 Introduction into programming Georg Trogemann, Christian Heck, Mattis

3 May 29, 2022

Libraries, tools and tasks created and used at DeepMind Robotics.

270 Nov 30, 2022

Code and Datasets from the paper "Self-supervised contrastive learning for volcanic unrest detection from InSAR data"

Code and Datasets from the paper "Self-supervised contrastive learning for volcanic unrest detection from InSAR data" You can download the pretrained

3 May 07, 2022

Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction

Welcome to Barlow Barlow is a tool for identifying the failure modes for a given neural network. To achieve this, Barlow first creates a group of imag

33 Dec 05, 2022

DeepOBS: A Deep Learning Optimizer Benchmark Suite

DeepOBS - A Deep Learning Optimizer Benchmark Suite DeepOBS is a benchmarking suite that drastically simplifies, automates and improves the evaluation

7 May 12, 2020

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

MoCo v3 for Self-supervised ResNet and ViT Introduction This is a PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT. The original M

887 Jan 08, 2023

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

FMFCC-A This project is the description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts. The FMFCC-A dataset is shared through BaiduCl

18 Dec 24, 2022