Confident Semantic Ranking Loss for Part Parsing

Last update: Oct 22, 2022

Related tags

Deep Learning CSR

Overview

How to run:

Dataset

Download PASCAL-Part dataset [https://cs.stanford.edu/~roozbeh/pascal-parts/pascal-parts.html]
Download the multi-class annotations from [http://cvteam.net/projects/2019/multiclass-part.html]
Modify the configurations in /experiments/CSR/config.py. (The initial performance is about 59.45, then the reported performance can be achieved by fine-tuning.)
Modify the dataset path in /lib/datasets

(There might be different versions of this dataset, we follow the annotations of CVPR17 to make fair comparisons.)

PASCAL-Part-multi-class Dataset: http://cvteam.net/projects/2019/figs/Affined.zip

For Test

Download the pretrained model and modify the path in /experiments/config.py
RUN /experiments/CSR/test.py
(Additionally) If customize data, you need to generate a filelist following the VOC format and modify the dataset path.

For Training

If training from scratch, simply run. If not, customize the dir in /experiments/CSR config.py.

(A training demo code is provided in train.py)

(Additionally) download the ImageNet pretrained model:

model_urls = {

'resnet18': 'https://download.pytorch.org/models/resnet18-5c106cde.pth',

'resnet34': 'https://download.pytorch.org/models/resnet34-333f7ec4.pth',

'resnet50': 'https://download.pytorch.org/models/resnet50-19c8e357.pth',

'resnet101': 'https://download.pytorch.org/models/resnet101-5d3b4d8f.pth',

'resnet152': 'https://download.pytorch.org/models/resnet152-b121ed2d.pth',

}
Prerequisites: generate semantic part boundaries and semantic object labels. (will be provided soon)
RUN /experiments/CSR/train.py for 100 epochs. (Achieve 59.45 mIoU)
Fine-tune the model using learning rate=0.003 for another 40 epochs. (Achieve 60.70 mIoU)

Acknowledgement

The code is based on the below project:

Yifan Zhao, Jia Li, Yu Zhang, and Yonghong Tian. Multi-class Part Parsing with Joint Boundary-Semantic Awareness in ICCV 2019.

Citation

@inproceedings{tan2021confident,
  title={Confident Semantic Ranking Loss for Part Parsing},
  author={Tan, Xin and Xu, Jiachen and Ye, Zhou and Hao, Jinkun and Ma, Lizhuang},
  booktitle={2021 IEEE International Conference on Multimedia and Expo (ICME)},
  pages={1--6},
  year={2021},
  organization={IEEE}
}

Confident Semantic Ranking Loss for Part Parsing

Related tags

Overview

How to run:

Dataset

PASCAL-Part-multi-class Dataset: http://cvteam.net/projects/2019/figs/Affined.zip

For Test

For Training

Acknowledgement

Yifan Zhao, Jia Li, Yu Zhang, and Yonghong Tian. Multi-class Part Parsing with Joint Boundary-Semantic Awareness in ICCV 2019.

Citation

Owner

Jiachen Xu

The missing CMake project initializer

High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Display, filter and search log messages in your terminal

Directed Greybox Fuzzing with AFL

This is the official Pytorch-version code of FlatGCN (Flattened Graph Convolutional Networks for Recommendation).

Low Complexity Channel estimation with Neural Network Solutions

PyTorch Implementation for "ForkGAN with SIngle Rainy NIght Images: Leveraging the RumiGAN to See into the Rainy Night"

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

Face and other object detection using OpenCV and ML Yolo

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

disentanglement_lib is an open-source library for research on learning disentangled representations.

EmoTag helps you train emotion detection model for Chinese audios

Automatically align face images 🙃→🙂. Can also do windowing and warping.

Domain Generalization with MixStyle, ICLR'21.

PyTorch and GPyTorch implementation of the paper "Conditioning Sparse Variational Gaussian Processes for Online Decision-making."

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

TensorFlow implementation of Style Transfer Generative Adversarial Networks: Learning to Play Chess Differently.

A hobby project which includes a hand-gesture based virtual piano using a mobile phone camera and OpenCV library functions