FreeSOLO for unsupervised instance segmentation, CVPR 2022

Last update: Jan 02, 2023

Overview

FreeSOLO: Learning to Segment Objects without Annotations

This project hosts the code for implementing the FreeSOLO algorithm for unsupervised instance segmentation.

FreeSOLO: Learning to Segment Objects without Annotations,
Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2022
arXiv preprint (arXiv 2202.12181)

Visual Results

Installation

Prerequisites

Linux or macOS with Python >= 3.6
PyTorch >= 1.5 and torchvision that matches the PyTorch installation.
scikit-image

Install PyTorch in Conda env

# create conda env
conda create -n detectron2 python=3.6
# activate the enviorment
conda activate detectron2
# install PyTorch >=1.5 with GPU
conda install pytorch torchvision -c pytorch

Build Detectron2 from Source

Follow the INSTALL.md to install Detectron2 (commit id 11528ce has been tested).

Datasets

Follow the datasets/README.md to set up the MS COCO dataset.

Pre-trained model

Download the DenseCL pre-trained model from here. Convert it to detectron2's format and put the converted model under "training_dir/pre-trained/DenseCL" directory.

python tools/convert-pretrain-to-detectron2.py {WEIGHT_FILE}.pth {WEIGHT_FILE}.pkl

Usage

Free Mask

Download the prepared free masks in json format from here. Put it under "datasets/coco/annotations" directory. Or, generate it by yourself:

bash inference_freemask.sh

Training

# train with free masks
bash train.sh

# generate pseudo labels
bash gen_pseudo_labels.sh

# self-train
bash train_pl.sh

Testing

Download the trained model from here.

bash test.sh {MODEL_PATH}

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@article{wang2022freesolo,
  title={{FreeSOLO}: Learning to Segment Objects without Annotations},
  author={Wang, Xinlong and Yu, Zhiding and De Mello, Shalini and Kautz, Jan and Anandkumar, Anima and Shen, Chunhua and Alvarez, Jose M},
  journal={arXiv preprint arXiv:2202.12181},
  year={2022}
}

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Related tags

Overview

FreeSOLO: Learning to Segment Objects without Annotations

Visual Results

Installation

Prerequisites

Install PyTorch in Conda env

Build Detectron2 from Source

Datasets

Pre-trained model

Usage

Free Mask

Training

Testing

Citations

Owner

NVIDIA Research Projects

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

Anti-UAV base on PaddleDetection

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

Campsite Reservation Finder

This repo generates the training data and the model for Morpheus-Deblend

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Accelerated Multi-Modal MR Imaging with Transformers

A lane detection integrated Real-time Instance Segmentation based on YOLACT (You Only Look At CoefficienTs)

AdaDM: Enabling Normalization for Image Super-Resolution

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite and .pb from .tflite.

This repository is maintained for the scientific paper tittled " Study of keyword extraction techniques for Electric Double Layer Capacitor domain using text similarity indexes: An experimental analysis "

Learning to trade under the reinforcement learning framework

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Compact Bidirectional Transformer for Image Captioning

Kaggle Feedback Prize - Evaluating Student Writing 15th solution

A TensorFlow implementation of Neural Program Synthesis from Diverse Demonstration Videos

Code for the paper: Adversarial Machine Learning: Bayesian Perspectives