Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Last update: May 28, 2022

Related tags

Deep Learning NRD_decoder

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

This repository needs mmsegmentation

Training

To train the model(s) in the paper, run this command:

python tools/train.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py

The batch size is 16 in this work. Please change the 'samples_per_gpu' in configs/base/datasets/.. accordingly

Evaluation

To evaluate my model at single-scale inference, run:

python tools/eval.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py  {path-to-checkpoint-file}   --eval mIoU

Pre-trained Models

Results

Our model achieves the following performance on :

[Semantic segmentation results]

Model name	datasets	mIoU	mIoU (ms)
NRD-r101	ade20k (val)	44.01	45.62
NRD-x101	ade20k (val)	44.34	46.35
NRD-r101	pascal-context(val)	52.31 (59 classes)	54.1 (59 classes)
NRD-r101	pascal-context(val)	47.5 (60 classes)	40.9 (60 classes)
NRD-r50	Cityscapes (val)	79.8	80.8
NRD-r101	Cityscapes (val)	80.7	82.0

Contributing

The code is mostly taken from mmsegmentation mmsegmentation is released under the Apache 2.0 license.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Related tags

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

Training

Evaluation

Pre-trained Models

Results

[Semantic segmentation results]

Contributing

Owner

Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

An example showing how to use jax to train resnet50 on multi-node multi-GPU

ComputerVision - This repository aims at realized easy network architecture

Solving reinforcement learning tasks which require language and vision

Bio-OFC gym implementation and Gym-Fly environment

Implementation of Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis

Tensorflow implementation of ID-Unet: Iterative Soft and Hard Deformation for View Synthesis.

Dataset and Code for the paper "DepthTrack: Unveiling the Power of RGBD Tracking" (ICCV2021), and "Depth-only Object Tracking" (BMVC2021)

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Learned model to estimate number of distinct values (NDV) of a population using a small sample.

Official Pytorch implementation of ICLR 2018 paper Deep Learning for Physical Processes: Integrating Prior Scientific Knowledge.

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Code for the paper Learning the Predictability of the Future

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

Prevent `CUDA error: out of memory` in just 1 line of code.

Creating predictive checklists from data using integer programming.