Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

Last update: Oct 16, 2022

Related tags

Deep Learning JOINT

Overview

JOINT

This is the official implementation of Joint Inductive and Transductive learning for Video Object Segmentation, to appear in ICCV 2021.

@inproceedings{joint_iccv_2021,
  title={Joint Inductive and Transductive Learning for Video Object Segmentation},
  author={Yunyao Mao, Ning Wang, Wengang Zhou, Houqiang Li},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  month = {October},
  year={2021}
}

Installation

Clone this repository

git clone https://github.com/maoyunyao/JOINT.git

Install dependencies

Please check the detailed installation instructions.

Training

The whole network is trained with 8 NVIDIA GTX 1080Ti GPUs

conda activate pytracking
cd ltr
python run_training.py joint joint_stage1  # stage 1
python run_training.py joint joint_stage2  # stage 2

Note: We initialize the backbone ResNet with pre-trained Mask-RCNN weights as in LWL. These weights can be obtained from here. Before training, you need to download and save these weights in env_settings().pretrained_networks directory.

Evaluation

conda activate pytracking
cd pytracking
python run_tracker.py joint joint_davis --dataset_name dv2017_val        # DAVIS 2017 Val
python run_tracker.py joint joint_ytvos --dataset_name yt2018_valid_all  # YouTube-VOS 2018 Val
python run_tracker.py joint joint_ytvos --dataset_name yt2019_valid_all  # YouTube-VOS 2019 Val

Note: Before evaluation, the pretrained networks (see model zoo) should be downloaded and saved into the directory set by "network_path" in "pytracking/evaluation/local.py". By default, it is set to pytracking/networks.

Model Zoo

Models

Model	YouTube-VOS 2018 (Overall Score)	YouTube-VOS 2019 (Overall Score)	DAVIS 2017 val (J&F score)	Links	Raw Results
JOINT_ytvos	83.1	82.8	--	model	results
JOINT_davis	--	--	83.5	model	results

Acknowledgments

Our JOINT segmentation tracker is implemented based on pytracking. We sincerely thank the authors Martin Danelljan and Goutam Bhat for providing such a great framework.
We adopt the few-shot learner proposed in LWL as the Induction branch.

Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

Related tags

Overview

JOINT

Installation

Clone this repository

Install dependencies

Training

Evaluation

Model Zoo

Models

Acknowledgments

Owner

Yunyao

Deep Anomaly Detection with Outlier Exposure (ICLR 2019)

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

Users can free try their models on SIDD dataset based on this code

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Submission to Twitter's algorithmic bias bounty challenge

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

A curated list of Generative Deep Art projects, tools, artworks, and models

Explaining Hyperparameter Optimization via PDPs

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

Fast and exact ILP-based solvers for the Minimum Flow Decomposition (MFD) problem, and variants of it.

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Code for TIP 2017 paper --- Illumination Decomposition for Photograph with Multiple Light Sources.

Materials for upcoming beginner-friendly PyTorch course (work in progress).

Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

Related tags

Overview

JOINT

Installation

Clone this repository

Install dependencies

Training

Evaluation

Model Zoo

Models

Acknowledgments

Owner

Yunyao

Deep Anomaly Detection with Outlier Exposure (ICLR 2019)

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

Users can free try their models on SIDD dataset based on this code

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Submission to Twitter's algorithmic bias bounty challenge

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

A curated list of Generative Deep Art projects, tools, artworks, and models

Explaining Hyperparameter Optimization via PDPs

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

Fast and exact ILP-based solvers for the Minimum Flow Decomposition (MFD) problem, and variants of it.

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Code for TIP 2017 paper --- Illumination Decomposition for Photograph with Multiple Light Sources.

Materials for upcoming beginner-friendly PyTorch course (work in progress).

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.