Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Last update: Aug 03, 2022

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Recently, more and more videos have been uploaded to the network, so that video analysis task has been one of the most important applications in various fields. At present, video analysis methods can be divided into two kinds: weakly supervised video action segmentation and supervised video action segmentation. The former uses a sliding window or Markov model, while the latter uses the TCN model. In this paper, we introduce the Supervised Sliding Window Smooth Loss Function (SSWS) into the TCN baseline, which is a complement to MS-TCN smoothing loss function TMSE. In this method, three discriminant frames are selected from the video prediction sequence and combined into an adaptive sliding window to selectively smooth the whole prediction sequence. In particular, it doubles the penalty when it slides to the wrong place in the category. Compared to TMSE, our method effectively increases the receptive field of smoothing loss function. And, the proposed new supervised loss function only penalizes error frames. The experiment shows that compared with the Smoothing loss function TMSE of MS-TCN, SSWS has significantly improved in the three datasets: 50Salads, GTEA and the Breakfast Dataset.

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Related tags

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Owner

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

AWS provides a Python SDK, "Boto3" ,which can be used to access the AWS-account from the local.

[SIGIR22] Official PyTorch implementation for "CORE: Simple and Effective Session-based Recommendation within Consistent Representation Space".

Weakly Supervised Segmentation by Tensorflow.

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

BiSeNet based on pytorch

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

Exadel CompreFace is a free and open-source face recognition GitHub project

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

Relative Positional Encoding for Transformers with Linear Complexity

Source code for Acorn, the precision farming rover by Twisted Fields

ML model to classify between cats and dogs

Code for Discriminative Sounding Objects Localization (NeurIPS 2020)

This is a simple framework to make object detection dataset very quickly

This is a collection of our NAS and Vision Transformer work.

HackBMU-5.0-Team-Ctrl-Alt-Elite - HackBMU 5.0 Team Ctrl Alt Elite