Object DGCNN & DETR3D

This repo contains the implementations of Object DGCNN (https://arxiv.org/abs/2110.06923) and DETR3D (https://arxiv.org/abs/2110.06922). Our implementations are built on top of MMdetection3D.

Prerequisite

mmcv (https://github.com/open-mmlab/mmcv)
mmdet (https://github.com/open-mmlab/mmdetection)
mmseg (https://github.com/open-mmlab/mmsegmentation)
mmdet3d (https://github.com/open-mmlab/mmdetection3d)

Data

Follow the mmdet3d to process the data.

Train

Downloads the pretrained backbone weights to pretrained/
For example, to train Object-DGCNN with pillar on 8 GPUs, please use

tools/dist_train.sh projects/configs/obj_dgcnn/pillar.py 8

Evaluation using pretrained models

Download the weights accordingly.

Backbone	mAP	NDS	Download
DETR3D, ResNet101 w/ DCN	34.7	42.2	model \| log
above, + CBGS	34.9	43.4	model \| log
DETR3D, VoVNet on trainval, evaluation on test set	41.2	47.9	model \| log

Backbone	mAP	NDS	Download
Object DGCNN, pillar	53.2	62.8	model \| log
Object DGCNN, voxel	58.6	66.0	model \| log

To test, use
tools/dist_test.sh projects/configs/obj_dgcnn/pillar_cosine.py /path/to/ckpt 8 --eval=bbox

If you find this repo useful for your research, please consider citing the papers

@inproceedings{
   obj-dgcnn,
   title={Object DGCNN: 3D Object Detection using Dynamic Graphs},
   author={Wang, Yue and Solomon, Justin M.},
   booktitle={2021 Conference on Neural Information Processing Systems ({NeurIPS})},
   year={2021}
}

@inproceedings{
   detr3d,
   title={DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries},
   author={Wang, Yue and Guizilini, Vitor and Zhang, Tianyuan and Wang, Yilun and Zhao, Hang and and Solomon, Justin M.},
   booktitle={The Conference on Robot Learning ({CoRL})},
   year={2021}
}

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Related tags

Overview

Object DGCNN & DETR3D

Prerequisite

Data

Train

Evaluation using pretrained models

Owner

Wang, Yue

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency

Saliency - Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).

Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)

Confident Semantic Ranking Loss for Part Parsing

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

U-Net: Convolutional Networks for Biomedical Image Segmentation

Leibniz is a python package which provide facilities to express learnable partial differential equations with PyTorch

Canonical Appearance Transformations

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

Supervised 3D Pre-training on Large-scale 2D Natural Image Datasets for 3D Medical Image Analysis

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

DCSL - Generalizable Crowd Counting via Diverse Context Style Learning

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

Machine Learning Platform for Kubernetes