Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Last update: Dec 23, 2022

Overview

CaraNet: Context Axial Reverse Attention Network for Small Medical Objects Segmentation

This repository contains the implementation of a novel attention based network (CaraNet) to segment the polyp (CVC-T, CVC-ClinicDB, CVC-ColonDB, ETIS and Kvasir) and brain tumor (BraTS). The CaraNet show great overall segmentation performance (mean dice) on polyp and brain tumor, but also show great performance on small medical objects (small polyps and brain tumors) segmentation.

The technique report is here: CaraNet

Architecture of CaraNet

Backbone

We use Res2Net as our backbone.

Context module

We choose our CFP module as context module, and choose the dilation rate is 8. For the details of CFP module you can find here: CFPNet. The architecture of CFP module as shown in following figure:

Axial Reverse Attention

As shown in architecture of CaraNet, the Axial Reverse Attention (A-RA) module contains two routes: 1) Reverse attention; 2) Axial-attention.

Installation & Usage

Enviroment

Enviroment: Python 3.6;
Install some packages:

conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch

conda install opencv-python pillow numpy matplotlib

Clone this repository

git clone https://github.com/AngeLouCN/CaraNet

Training

Download the training and texting dataset from this link: Experiment Dataset
Change the --train_path & --test_path in Train.py
Run Train.py
Testing dataset is ordered as follow:

|-- TestDataset
|   |-- CVC-300
|   |   |-- images
|   |   |-- masks
|   |-- CVC-ClinicDB
|   |   |-- images
|   |   |-- masks
|   |-- CVC-ColonDB
|   |   |-- images
|   |   |-- masks
|   |-- ETIS-LaribPolypDB
|   |   |-- images
|   |   |-- masks
|   |-- Kvasir
|       |-- images
|       |-- masks

Testing

Change the data_path in Test.py

Evaluation

Change the image_root and gt_root in eval_Kvasir.py
You can also run the matlab code in eval fold, it contains other four measurement metrics results.
You can download the segmentation maps of CaraNte from this link: CaraNet

Segmentation Results

Polyp Segmentation Results

Small polyp analysis

The x-axis is the proportion size (%) of polyp; y-axis is the average mean dice coefficient.

Kvasir	CVC-ClinicDB	CVC-ColonDB	ETIS	CVC-300

Brain Tumor Segmentation Results

Small tumor analysis

Citation

@article{lou2021cfpnet,
  title={CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation},
  author={Lou, Ange and Loew, Murray},
  journal={arXiv preprint arXiv:2103.12212},
  year={2021}
}

Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Related tags

Overview

CaraNet: Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Architecture of CaraNet

Backbone

Context module

Axial Reverse Attention

Installation & Usage

Enviroment

Training

Testing

Evaluation

Segmentation Results

Citation

Owner

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

A multi-entity Transformer for multi-agent spatiotemporal modeling.

MISSFormer: An Effective Medical Image Segmentation Transformer

A dual benchmarking study of visual forgery and visual forensics techniques

Perception-aware multi-sensor fusion for 3D LiDAR semantic segmentation (ICCV 2021)

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.

[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning

make ASCII Art by Deep Learning

This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider

Generic ecosystem for feature extraction from aerial and satellite imagery

This repo is the official implementation for Multi-Scale Adaptive Graph Neural Network for Multivariate Time Series Forecasting

Algebraic effect handlers in Python

Code for "My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack" paper

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Programming with Neural Surrogates of Programs

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Multi-Modal Machine Learning toolkit based on PaddlePaddle.

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)