Detectorch - detectron for PyTorch

Last update: Dec 23, 2022

Overview

Detectorch - detectron for PyTorch

(Disclaimer: this is work in progress and does not feature all the functionalities of detectron. Currently only inference and evaluation are supported -- no training) (News: Now supporting FPN and ResNet-101!)

This code allows to use some of the Detectron models for object detection from Facebook AI Research with PyTorch.

It currently supports:

Fast R-CNN
Faster R-CNN
Mask R-CNN

It supports ResNet-50/101 models with or without FPN. The pre-trained models from caffe2 can be imported and used on PyTorch.

Example Mask R-CNN with ResNet-101 and FPN.

Evaluation

Both bounding box evaluation and instance segmentation evaluation where tested, yielding the same results as in the Detectron caffe2 models. These results below have been computed using the PyTorch code:

Model	box AP	mask AP	model id
fast_rcnn_R-50-C4_2x	35.6		36224046
fast_rcnn_R-50-FPN_2x	36.8		36225249
e2e_faster_rcnn_R-50-C4_2x	36.5		35857281
e2e_faster_rcnn_R-50-FPN_2x	37.9		35857389
e2e_mask_rcnn_R-50-C4_2x	37.8	32.8	35858828
e2e_mask_rcnn_R-50-FPN_2x	38.6	34.5	35859007
e2e_mask_rcnn_R-101-FPN_2x	40.9	36.4	35861858

Training

Training code is experimental. See train_fast.py for training Fast R-CNN. It seems to work, but slow.

Installation

First, clone the repo with git clone --recursive https://github.com/ignacio-rocco/detectorch so that you also clone the Coco API.

The code can be used with PyTorch 0.3.1 or PyTorch 0.4 (master) under Python 3. Anaconda is recommended. Other required packages

torchvision (conda install torchvision -c soumith)
opencv (conda install -c conda-forge opencv )
cython (conda install cython)
matplotlib (conda install matplotlib)
scikit-image (conda install scikit-image)
ninja (conda install ninja) (required for Pytorch 0.4 only)

Additionally, you need to build the Coco API and RoIAlign layer. See below.

Compiling the Coco API

If you cloned this repo with git clone --recursive you should have also cloned the cocoapi in lib/cocoapi. Compile this with:

cd lib/cocoapi/PythonAPI
make install

Compiling RoIAlign

The RoIAlign layer was converted from the caffe2 version. There are two different implementations for each PyTorch version:

Pytorch 0.4: RoIAlign using ATen library (lib/cppcuda). Compiled JIT when loaded.
PyTorch 0.3.1: RoIAlign using TH/THC and cffi (lib/cppcuda_cffi). Needs to be compiled with:

cd lib/cppcuda_cffi
./make.sh

Quick Start

Check the demo notebook.

Detectorch - detectron for PyTorch

Related tags

Overview

Detectorch - detectron for PyTorch

Evaluation

Training

Installation

Compiling the Coco API

Compiling RoIAlign

Quick Start

Owner

Ignacio Rocco

Blender add-on: Add to Cameras menu: View → Camera, View → Add Camera, Camera → View, Previous Camera, Next Camera

Easily pull telemetry data and create beautiful visualizations for analysis.

NeoDTI: Neural integration of neighbor information from a heterogeneous network for discovering new drug-target interactions

A modular active learning framework for Python

Realistic lighting in ursina!

A simple software for capturing human body movements using the Kinect camera.

Rasterize with the least efforts for researchers.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

End-To-End Memory Network using Tensorflow

Real-time analysis of intracranial neurophysiology recordings.

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing

Hummingbird compiles trained ML models into tensor computation for faster inference.

Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

TResNet: High Performance GPU-Dedicated Architecture

Defending graph neural networks against adversarial attacks (NeurIPS 2020)

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

MagFace: A Universal Representation for Face Recognition and Quality Assessment