git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Last update: Nov 28, 2022

Related tags

Overview

USD-Seg

This project is an implement of paper USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation, based on FCOS detector from MMDetection tool box.

Introduction

We present a novel explicit shape representation for instance segmentation. The proposed USD-Seg adopts a linear model, sparse coding with dictionary, for object shapes. First, it learns a dictionary from a large collection of shape datasets, making any shape being able to be decomposed into a linear combination through the dictionary. Hence the name "Universal Shape Dictionary". It adds a simple shape vector regression head to ordinary object detector, giving the detector segmentation ability with minimal overhead.

License

This project is released under the Apache 2.0 license.

Model

The overall pipeline of USD-Seg: an RGB image is input to the base detector, and the base detector will regress both detection related information (bounding box and class) and the shape vector. Then the mask will be decoded by simple multiplication between shape vector and dictionary atoms, followed by proper resize and threshold operations.

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.
We follow the original usage of mmdetection framework. You can use configs for usd-seg in /configs/usdseg/ to train from scratch.

Citation

If you use this toolbox or benchmark in your research, please cite this project and mmdetection.

@article{USD-Seg,
  title   = {Learning Universal Shape Dictionary for Realtime Instance Segmentation},
  author  = {Tang, Tutian and Xu, Wenqiang and Ye, Ruolin and Yang, Lixin and Lu, Cewu},
  journal= {arXiv preprint arXiv:2012.01050},
  year={2020}
}

Contact

This repo is currently maintained by Tutian tang (@ElectronicElephant)and Ruolin Ye (@YoruCathy). Other core developers include Wenqiang Xu (@WenqiangX). For technical details, please feel free to contact the authors directly via Email.

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Related tags

Overview

USD-Seg

Introduction

License

Model

Installation

Get Started

Citation

Contact

Owner

Ruolin Ye

Food Drinks and groceries Images Multi Lingual (FooDI-ML) dataset.

Physics-Aware Training (PAT) is a method to train real physical systems with backpropagation.

Code for "Multi-Time Attention Networks for Irregularly Sampled Time Series", ICLR 2021.

Official implementation for the paper: Generating Smooth Pose Sequences for Diverse Human Motion Prediction

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Code for the Paper "Diffusion Models for Handwriting Generation"

codes for IKM (arXiv2021, Submitted to IEEE Trans)

Kindle is an easy model build package for PyTorch.

Benchmark tools for Compressive LiDAR-to-map registration

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Multilingual Image Captioning

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

PINN(s): Physics-Informed Neural Network(s) for von Karman vortex street

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

Combining Diverse Feature Priors

Official PyTorch implementation of RIO

Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration