PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Last update: Jan 03, 2023

Related tags

Overview

ReSim

This repository provides the PyTorch implementation of Region Similarity Representation Learning (ReSim) described in this paper:

@Article{xiao2021region,
  author  = {Tete Xiao and Colorado J Reed and Xiaolong Wang and Kurt Keutzer and Trevor Darrell},
  title   = {Region Similarity Representation Learning},
  journal = {arXiv preprint arXiv:2103.12902},
  year    = {2021},
}

tldr; ReSim maintains spatial relationships in the convolutional feature maps when performing instance contrastive pre-training, which is useful for region-related tasks such as object detection, segmentation, and dense pose estimation.

Installation

Assuming a conda environment:

conda create --name resim python=3.7
conda activate resim

# NOTE: if you are not using CUDA 10.2, you need to change the 10.2 in this command appropriately. 
# Code tested with torch 1.6 and 1.7
# (check CUDA version with e.g. `cat /usr/local/cuda/version.txt`)
conda install pytorch==1.6.0 torchvision==0.7.0 cudatoolkit=10.2 -c pytorch

Pre-training

This codebase is based on the original MoCo codebase -- see this README for more details.

To pre-train for 200 epochs using the ReSim-FPN implementation as described in the paper:

python main_moco.py -a resnet50 --lr 0.03 --batch-size 256 \
       --dist-url tcp://localhost:10005 --multiprocessing-distributed --world-size 1 --rank 0 \
       --mlp --moco-t 0.2 --aug-plus --cos --epochs 200 \
       /location/of/imagenet/data/folder

ResNet-50 Pre-trained Models

Checkpoint	Pre-train Epochs	COCO AP @2x	MoCo Checkpoint	Detectron Backbone
ReSim-FPN	400	41.9	Download	Download
ReSim-FPN	200	41.4	Download	Download
ReSim-C4	200	41.1	Download	Download

Detection

See these instructions for more details, but in brief:

# first install detectron2
# then place COCO-2017 dataset detection/datasets/coco

cd detection
python convert-pretrain-to-detectron2.py ../resim_fpn_checkpoint_latest.pth.tar detectron_resim_fpn_checkpoint_latest.pth.tar
python train_net.py --dist-url 'tcp://127.0.0.1:17654' --config-file configs/coco_R_50_FPN_2x_moco.yaml --num-gpus 8 MODEL.WEIGHTS detectron_resim_fpn_checkpoint_latest.pth.tar TEST.EVAL_PERIOD 180000 OUTPUT_DIR results/coco2x-resim-fpn SOLVER.CHECKPOINT_PERIOD 180000

License

This project is under the CC-BY-NC 4.0 license. See LICENSE.

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Related tags

Overview

ReSim

Installation

Pre-training

ResNet-50 Pre-trained Models

Detection

License

Owner

Tete Xiao

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs

Bayesian Generative Adversarial Networks in Tensorflow

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

Planner_backend - Academic planner application designed for students and counselors.

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

Pre-trained NFNets with 99% of the accuracy of the official paper

MiniSom is a minimalistic implementation of the Self Organizing Maps

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Code repository for our paper regarding the L3D dataset.

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

🏃‍♀️ A curated list about human motion capture, analysis and synthesis.

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Bringing Computer Vision and Flutter together , to build an awesome app !!

🔀 Visual Room Rearrangement

This is the code repository for the paper "Identification of the Generalized Condorcet Winner in Multi-dueling Bandits" (NeurIPS 2021).