A Survey on Deep Learning Technique for Video Segmentation

A Survey on Deep Learning Technique for Video Segmentation
Wenguan Wang, Tianfei Zhou, Fatih Porikli, David Crandall, and Luc Van Gool.

Contributing

Please feel free to create issues or pull requests to add papers.

Welcome any discussions on video segmentation at

1. Introduction

Video segmentation, i.e., partitioning video frames into multiple segments or objects, plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to virtual background creation in video conferencing. In this survey, we comprehensively review two basic lines of research — video object segmentation and video semantic segmentation — by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. In particular, we review eight sub-fields as given in the following figure:

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Popular Datasets in VOS and VSS

Citation

If you find our survey and repository useful for your research, please consider citing our paper:

@article{wang2021survey,
  title={A survey on deep learning technique for video segmentation},
  author={Wang, Wenguan and Zhou, Tianfei and Porikli, Fatih and Crandall, David and Van Gool, Luc},
  journal={arXiv preprint arXiv:2107.01153},
  year={2021}
}

A Survey on Deep Learning Technique for Video Segmentation

Related tags

Overview

A Survey on Deep Learning Technique for Video Segmentation

Contributing

1. Introduction

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Citation

Owner

Tianfei Zhou

CBKH: The Cornell Biomedical Knowledge Hub

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

Towards Long-Form Video Understanding

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.

GEA - Code for Guided Evolution for Neural Architecture Search

Malware Analysis Neural Network project.

A simple interface for editing natural photos with generative neural networks.

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

Automatic library of congress classification, using word embeddings from book titles and synopses.

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

An implementation of a discriminant function over a normal distribution to help classify datasets.

Semi-supervised learning for object detection

'Aligned mixture of latent dynamical systems' (amLDS) for stimulus decoding probabilistic manifold alignment across animals. P. Herrero-Vidal et al. NeurIPS 2021 code.

Implements an infinite sum of poisson-weighted convolutions

A transformer which can randomly augment VOC format dataset (both image and bbox) online.

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)