A baseline code for VSPW

Last update: Aug 22, 2022

Related tags

Deep Learning VSPW_baseline

Overview

A baseline code for VSPW

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Please download the 480p version of VSPW dataset.

Dependencies

Python 3.7
Pytorch 1.7
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A baseline code for VSPW

Related tags

Overview

A baseline code for VSPW

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Evaluation on TC and VC

Citation

Owner

PyTorch implementation of Self-supervised Contrastive Regularization for DG (SelfReg)

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Gesture-controlled Video Game. Just swing your finger and play the game without touching your PC

An implementation of RetinaNet in PyTorch.

Deep Markov Factor Analysis (NeurIPS2021)

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Google AI Open Images - Object Detection Track: Open Solution

Learned model to estimate number of distinct values (NDV) of a population using a small sample.

Replication Package for "An Empirical Study of the Effectiveness of an Ensemble of Stand-alone Sentiment Detection Tools for Software Engineering Datasets"

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

This is a collection of our NAS and Vision Transformer work.

Rank 1st in the public leaderboard of ScanRefer (2021-03-18)

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

abess: Fast Best-Subset Selection in Python and R

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

A Keras implementation of YOLOv3 (Tensorflow backend)

Code from Daniel Lemire, A Better Alternative to Piecewise Linear Time Series Segmentation

SimplEx - Explaining Latent Representations with a Corpus of Examples

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".