An official implementation of the Anchor DETR.

Last update: Dec 28, 2022

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

This repository is an official implementation of the Anchor DETR. We encode the anchor points as the object queries in DETR. Multiple patterns are attached to each anchor point to solve the difficulty: "one region, multiple objects". We also propose an attention variant RCDA to reduce the memory cost for high-resolution features.

Main Results

	feature	epochs	AP	GFLOPs	Infer Speed (FPS)
DETR	DC5	500	43.3	187	10 (12)
SMCA	multi-level	50	43.7	152	10
Deformable DETR	multi-level	50	43.8	173	15
Conditional DETR	DC5	50	43.8	195	10
Anchor DETR	DC5	50	44.3	151	16 (19)

Note:

The results are based on ResNet-50 backbone.
Inference speeds are measured on NVIDIA Tesla V100 GPU.
The values in parentheses of the Infer Speed indicate the speed with torchscript optimization.

Model

name	backbone	AP	URL
AnchorDETR-C5	R50	42.1	model / log
AnchorDETR-DC5	R50	44.3	model / log
AnchorDETR-C5	R101	43.5	model / log
AnchorDETR-DC5	R101	45.1	model / log

Note: the models and logs are also available at Baidu Netdisk with code hh13.

Usage

Installation

First, clone the repository locally:

git clone https://github.com/megvii-research/AnchorDETR.git

Then, install dependencies:

pip install -r requirements.txt

Training

To train AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py  --coco_path /path/to/coco

Evaluation

To evaluate AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

To evaluate AnchorDETR with a single GPU:

python main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

Citation

If you find this project useful for your research, please consider citing the paper.

@misc{wang2021anchor,
      title={Anchor DETR: Query Design for Transformer-Based Detector},
      author={Yingming Wang and Xiangyu Zhang and Tong Yang and Jian Sun},
      year={2021},
      eprint={2109.07107},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions, feel free to open an issue or contact us at [email protected].

An official implementation of the Anchor DETR.

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

Main Results

Model

Usage

Installation

Training

Evaluation

Citation

Contact

Owner

MEGVII Research

Tensorflow implementation of soft-attention mechanism for video caption generation.

Supercharging Imbalanced Data Learning WithCausal Representation Transfer

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

This repository contains the code for: RerrFact model for SciVer shared task

Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

Here we present the implementation in TensorFlow of our work about liver lesion segmentation accepted in the Machine Learning 4 Health Workshop

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

graph-theoretic framework for robust pairwise data association

Learn other languages using artificial intelligence with python.

Generalized Decision Transformer for Offline Hindsight Information Matching

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Library for 8-bit optimizers and quantization routines.

Autoregressive Models in PyTorch.

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

A python bot to move your mouse every few seconds to appear active on Skype, Teams or Zoom as you go AFK. 🐭 🤖

Classification models 1D Zoo - Keras and TF.Keras

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

MMDetection3D is an open source object detection toolbox based on PyTorch

An official implementation of the Anchor DETR.

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

Main Results

Model

Usage

Installation

Training

Evaluation

Citation

Contact

Owner

MEGVII Research

Tensorflow implementation of soft-attention mechanism for video caption generation.

Supercharging Imbalanced Data Learning WithCausal Representation Transfer

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

This repository contains the code for: RerrFact model for SciVer shared task

Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

Here we present the implementation in TensorFlow of our work about liver lesion segmentation accepted in the Machine Learning 4 Health Workshop

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

graph-theoretic framework for robust pairwise data association

Learn other languages ​​using artificial intelligence with python.

Generalized Decision Transformer for Offline Hindsight Information Matching

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Library for 8-bit optimizers and quantization routines.

Autoregressive Models in PyTorch.

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

A python bot to move your mouse every few seconds to appear active on Skype, Teams or Zoom as you go AFK. 🐭 🤖

Classification models 1D Zoo - Keras and TF.Keras

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

MMDetection3D is an open source object detection toolbox based on PyTorch

Learn other languages using artificial intelligence with python.