OBG-FCN

This repository is to reproduce the implementation of 'Object Boundary Guided Semantic Segmentation' in http://arxiv.org/abs/1603.09742

Object Boundary Guided Semantic Segmentation
Qin Huang, Chunyang Xia, Wenchao Zheng, Yuhang Song, Hao Xu, C.-C. Jay Kuo
arXiv:1603.09742

the paper claimed to achieve 87.5% mean IU in PASCAL VOC 2011 validation set with only the training images of VOC 2011 training set.

The code is based on the repository of https://github.com/shelhamer/fcn.berkeleyvision.org, which contains the offical code for the paper:

Fully Convolutional Models for Semantic Segmentation
Jonathan Long*, Evan Shelhamer*, Trevor Darrell
CVPR 2015
arXiv:1411.4038

The implementation is just for test and could not achieve result close to Object Boundary Guided Semantic Segmentation so far. Any suggestion is more than welcome

Mdoels are trained using extra data from Hariharan et al., but excluding SBD val. Mdoels are tested using aug_val set by excluding the overlapping images in VOC train_val dataset.

Here is the result so far:

[FCN-32s sbd]: mean IU 0.601230112927 on aug_val
[FCN-16s sbd]: mean IU 0.623964674094 on aug_val
[FCN-8s sbd]: mean IU 0.625525553796 on aug_val
[FCN-16s OBG-8s sbd]: mean IU 0.628746446579 on aug_val
[FCN-8s OBG-8s sbd]: mean IU 0.630523623869 on aug_val
[FCN-8s OBG-4s sbd]: mean IU 0.593030120308 on aug_val
[FCN-8s OBG-2s sbd]: mean IU 0.577085377376 on aug_val

model link:

[FCN-8s OBG-2s sbd]: voc-fcn8s-obg2s.caffemodel: https://drive.google.com/open?id=0B5i4atpKg9EcRU9rb1lwd1VnTlE
[FCN-8s OBG-4s sbd]: voc-fcn8s-obg4s.caffemodel: https://drive.google.com/open?id=0B5i4atpKg9EcU3U5Xy05Tm5kX0U
[FCN-8s OBG-8s sbd]: voc-fcn8s-obg8s.caffemodel: https://drive.google.com/open?id=0B5i4atpKg9EcMWJXcFl5MGdwQ2c

There must be major bugs in the implementation since the performace decreased when combining pool2 and pool1 for object boundary.

OBG-FCN - implementation of 'Object Boundary Guided Semantic Segmentation'

Related tags

Overview

OBG-FCN

Owner

Jiu XU

Orthogonal Over-Parameterized Training

Huawei Hackathon 2021 - Sweden (Stockholm)

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Vehicle direction identification consists of three module detection , tracking and direction recognization.

A set of tools to pre-calibrate and calibrate (multi-focus) plenoptic cameras (e.g., a Raytrix R12) based on the libpleno.

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

code associated with ACL 2021 DExperts paper

Evolving neural network parameters in JAX.

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

A faster pytorch implementation of faster r-cnn

End-To-End Optimization of LiDAR Beam Configuration

An Unsupervised Detection Framework for Chinese Jargons in the Darknet

To model the probability of a soccer coach leave his/her team during Campeonato Brasileiro for 10 chosen teams and considering years 2018, 2019 and 2020.

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

ROS Basics and TurtleSim

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Playing around with FastAPI and streamlit to create a YoloV5 object detector

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Pytorch implementation of the paper Progressive Growing of Points with Tree-structured Generators (BMVC 2021)

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation