A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Last update: Nov 29, 2022

Related tags

Deep Learning CVPR2021_VSPW_Implement

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Now you can directly download VSPW_480P dataset.

Dependencies

Python 3.7
Pytorch 1.3.1
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Resize the frames and masks of the VSPW dataset to 480p.

python change2_480p.py

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Video-based methods

TCB-PSP

sh run_temporal_psp.sh

TCB-OCR

sh run_temporal_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Related tags

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Video-based methods

Evaluation on TC and VC

Citation

Owner

A scikit-learn compatible neural network library that wraps PyTorch

Tutorial on scikit-learn and IPython for parallel machine learning

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

Contrastive Learning of Structured World Models

Machine learning notebooks in different subjects optimized to run in google collaboratory

A JAX-based research framework for writing differentiable numerical simulators with arbitrary discretizations

A powerful framework for decentralized federated learning with user-defined communication topology

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

Apache Spark - A unified analytics engine for large-scale data processing

A generalist algorithm for cell and nucleus segmentation.

StyleGAN2-ADA-training-jupyter - Training custom datasets in styleGAN2-ADA by NVIDIA using Jupyter

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

Match SafeGraph POIs with Data collected through a cultural resource survey in Washington DC.

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

Solve a Rubiks Cube using Python Opencv and Kociemba module

A Strong Baseline for Image Semantic Segmentation

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

DziriBERT: a Pre-trained Language Model for the Algerian Dialect