A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation"

A Pytorch Implementation of Source Data-free Domain Adaptation for a Faster R-CNN

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Speed-Test - You can check your intenet speed using this tool

Navigating StyleGAN2 w latent space using CLIP

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

Causal Imitative Model for Autonomous Driving

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Demo for Real-time RGBD-based Extended Body Pose Estimation paper

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Official code for UnICORNN (ICML 2021)

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"

A hybrid framework (neural mass model + ML) for SC-to-FC prediction

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos