Awesome Monocular 3D detection

Overview

Awesome Monocular 3D detection

Paper list of 3D detetction, keep updating!

Contents

Paper List

2022

  • [MonoDistill] MonoDistill: Learning Spatial Features for Monocular 3D Object Detection [ICLR2022][Pytorch]
  • [MonoCon] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection [AAAI2022][Pytorch]
  • [ImVoxelNet] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection [WACV2022][Pytorch]

2021

  • [PCT] Progressive Coordinate Transforms for Monocular 3D Object Detection [NeurIPS2021][Pytorch]
  • [DFR-Net] The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection [ICCV2021]
  • [AutoShape] AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection [ICCV2021][Pytorch][Paddle]
  • [pseudo-analysis] Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? [ICCV2021]
  • [Gated3D] Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues [ICCV2021]
  • [MonoRCNN] Geometry-based Distance Decomposition for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [DD3D] Is Pseudo-Lidar needed for Monocular 3D Object detection [ICCV2021][Pytorch]
  • [GUPNet] Geometry Uncertainty Projection Network for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [Neighbor-Vote] Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting [ACMMM2021]
  • [MonoEF] Monocular 3D Object Detection: An Extrinsic Parameter Free Approach [CVPR2021][Pytorch]
  • [monodle] Delving into Localization Errors for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [Monoflex] Objects are Different: Flexible Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [GrooMeD-NMS] GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [DDMP-3D] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [MonoRUn] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation [CVPR2021][Pytorch]
  • [M3DSSD] M3DSSD: Monocular 3D Single Stage Object Detector [CVPR2021][Pytorch]
  • [CaDDN] Categorical Depth Distribution Network for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [visualDet3D] Ground-aware Monocular 3D Object Detection for Autonomous Driving [RA-L][Pytorch]

2020

  • [UR3D] Distance-Normalized Unified Representation for Monocular 3D Object Detection [ECCV2020]
  • [MonoDR] Monocular Differentiable Rendering for Self-Supervised 3D Object Detection [ECCV2020]
  • [DA-3Ddet] Monocular 3d object detection via feature domain adaptation [ECCV2020]
  • [MoVi-3D] Towards generalization across depth for monocular 3d object detection [ECCV2020]
  • [PatchNet] Rethinking Pseudo-LiDAR Representation [ECCV2020][Pytorch]
  • [RAR-Net] Reinforced Axial Refinement Network for Monocular 3D Object Detection [ECCV2020]
  • [kinematic3d] Kinematic 3D Object Detection in Monocular Video [ECCV2020][Pytorch]
  • [RTM3D] RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving [ECCV2020][Pytorch]
  • [SMOKE] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation [CVPRW2020][Pytorch]
  • [D4LCN] Learning Depth-Guided Convolutions for Monocular 3D Object Detection [CVPRW2020][Pytorch]
  • [MonoPair] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships [CVPR2020]
  • [pseudo-LiDAR_e2e] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection [CVPR2020][Pytorch]
  • [Pseudo-LiDAR++] Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving [ICLR2020][Pytorch]
  • [OACV] Object-Aware Centroid Voting for Monocular 3D Object Detection [IROS2020]
  • [MonoGRNet_v2] Monocular 3D Object Detection via Geometric Reasoning on Keypoints [VISIGRAPP2020]
  • [ForeSeE] Task-Aware Monocular Depth Estimation for 3D Object Detection [AAAI2020(oral)][Pytorch]
  • [Decoupled-3D] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation [AAAI2020]

2019

  • [3d-vehicle-tracking] Joint Monocular 3D Vehicle Detection and Tracking [ICCV2019][Pytorch]
  • [MonoDIS] Disentangling monocular 3d object detection [ICCV2019]
  • [AM3D] Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving [ICCV2019]
  • [M3D-RPN] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [ICCV2019(Oral)][Pytorch]
  • [MVRA] Multi-View Reprojection Architecture for Orientation Estimation [ICCVW2019]
  • [Mono3DPLiDAR] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud [ICCVW2019]
  • [MonoPSR] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction [CVPR2019][Pytorch]
  • [FQNet] Deep fitting degree scoring network for monocular 3d object detection [CVPR2019]
  • [ROI-10D] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape [CVPR2019]
  • [GS3D] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving [CVPR2019]
  • [Pseudo-LiDAR] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving [CVPR2019][Pytorch]
  • [BirdGAN] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles [IROS2019]
  • [MonoGRNet] MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization [AAAI2019(oral)][Tensorflow]
  • [OFT-Net] Orthographic feature transform for monocular 3d object detection [BMVC2019][Pytorch]
  • [Shift R-CNN] Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints [TIP2019]
  • [SS3D] SS3D: Monocular 3d object detection and box fitting trained end-to-end using intersection-over-union loss [Arxiv2019]

2018

  • [Multi-Fusion] Multi-Level Fusion based 3D Object Detection from Monocular Images [CVPR2018][Pytorch]
  • [Mono3D++] Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors [AAAI2018]

2017

  • [Deep3DBox] 3D Bounding Box Estimation Using Deep Learning and Geometry [CVPR2017][Pytorch][Tensorflow]
  • [Deep MANTA] Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image [CVPR2017]

2016

  • [Mono3D] Monocular 3D object detection for autonomous driving [CVPR2016]

KITTI Results

Method Extra Test, AP3D|R40 Val, AP3D|R40 Val, AP3D|R11 Reference
Easy Mod. Hard Easy Mod. Hard Easy Mod. Hard
MonoRUn Lidar 19.65 12.30 10.58 20.02 14.65 12.61 - - - CVPR2021
CaDDN Lidar 19.17 13.41 11.46 23.57 16.31 13.84 - - - CVPR2021
AM3D Depth 16.50 10.74 9.52 28.31 15.76 12.24 32.23 21.09 17.26 ICCV2019
PatchNet Depth 15.68 11.12 10.17 31.60 16.80 13.80 35.10 22.00 19.60 ECCV2020
D4LCN Depth 16.65 11.72 9.51 22.32 16.20 12.30 26.97 21.72 18.22 CVPRW2020
DFR-Net Depth 19.40 13.63 10.35 24.81 17.78 14.41 28.80 22.88 19.47 ICCV2021
M3D-RPN None 14.76 9.71 7.42 14.53 11.07 8.65 20.27 17.06 15.21 ICCV2019
SMOKE None 14.03 9.76 7.84 - - - 14.76 12.85 11.50 CVPRW2020
MonoPair None 13.04 9.99 8.65 16.28 12.30 10.42 - - - CVPR2020
RTM3D None 14.41 10.34 8.77 - - - 20.77 16.86 16.63 ECCV2020
M3DSSD None 17.51 11.46 8.98 - - - 27.77 21.67 18.28 CVPR2021
Monoflex None 19.94 13.89 12.07 23.64 17.51 14.83 28.17 21.92 19.07 CVPR2021
GUPNet None 20.11 14.20 11.77 22.76 16.46 13.72 - - - ICCV2021
MonoCon None 22.50 16.46 13.95 26.33 19.01 15.98 - - - AAAI2022
Owner
Zhikang Zou
Baidu Inc.
Zhikang Zou
A nutritional label for food for thought.

Lexiscore As a first effort in tackling the theme of information overload in content consumption, I've been working on the lexiscore: a nutritional la

Paul Bricman 34 Nov 08, 2022
code associated with ACL 2021 DExperts paper

DExperts Hi! This repository contains code for the paper DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts to appear at

Alisa Liu 68 Dec 15, 2022
Train DeepLab for Semantic Image Segmentation

Train DeepLab for Semantic Image Segmentation Martin Kersner, [email protected]

Martin Kersner 172 Dec 14, 2022
Gym-TORCS is the reinforcement learning (RL) environment in TORCS domain with OpenAI-gym-like interface.

Gym-TORCS Gym-TORCS is the reinforcement learning (RL) environment in TORCS domain with OpenAI-gym-like interface. TORCS is the open-rource realistic

naoto yoshida 400 Dec 27, 2022
Automatically download the cwru data set, and then divide it into training data set and test data set

Automatically download the cwru data set, and then divide it into training data set and test data set.自动下载cwru数据集,然后分训练数据集和测试数据集

6 Jun 27, 2022
ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

ManiSkill-Learn ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge, a large-scale learning-from-dem

Hao Su's Lab, UCSD 48 Dec 30, 2022
Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

Self-attention building blocks for computer vision applications in PyTorch Implementation of self attention mechanisms for computer vision in PyTorch

AI Summer 962 Dec 23, 2022
Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

This is a playground for pytorch beginners, which contains predefined models on popular dataset. Currently we support mnist, svhn cifar10, cifar100 st

Aaron Chen 2.4k Dec 28, 2022
Council-GAN - Implementation for our paper Breaking the Cycle - Colleagues are all you need (CVPR 2020)

Council-GAN Implementation of our paper Breaking the Cycle - Colleagues are all you need (CVPR 2020) Paper Ori Nizan , Ayellet Tal, Breaking the Cycle

ori nizan 260 Nov 16, 2022
Segmentation and Identification of Vertebrae in CT Scans using CNN, k-means Clustering and k-NN

Segmentation and Identification of Vertebrae in CT Scans using CNN, k-means Clustering and k-NN If you use this code for your research, please cite ou

41 Dec 08, 2022
Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have undergone breast cancer surgery.

Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have underg

Nafis Ahmed 1 Dec 28, 2021
Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation The code of: Cross-Image Region Mining with Region Proto

LiuWeide 16 Nov 26, 2022
Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Preference-Planning-Deep-IRL Introduction Check my portfolio post Dependencies Gym stable-baselines3 PyTorch Usage Take Demonstration python3 record.

Tianyu Li 9 Oct 26, 2022
level1-image-classification-level1-recsys-09 created by GitHub Classroom

level1-image-classification-level1-recsys-09 ❗ 주제 설명 COVID-19 Pandemic 상황 속 마스크 착용 유무 판단 시스템 구축 마스크 착용 여부, 성별, 나이 총 세가지 기준에 따라 총 18개의 class로 구분하는 모델 ?

6 Mar 17, 2022
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models

This is the official implementation of the following paper: Torsten Scholak, Nathan Schucher, Dzmitry Bahdanau. PICARD - Parsing Incrementally for Con

ElementAI 217 Jan 01, 2023
Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

Think Bayes 2 by Allen B. Downey The HTML version of this book is here. Think Bayes is an introduction to Bayesian statistics using computational meth

Allen Downey 1.5k Jan 08, 2023
Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning Official implementation of ACC, described in the paper "Adaptively Calibrated C

3 Sep 16, 2022
A flag generation AI created using DeepAIs API

Vex AI or Vexiology AI is an Artifical Intelligence created to generate custom made flag design texts. It uses DeepAIs API. Please be aware that you must include your own DeepAI API key. See instruct

Bernie 10 Apr 06, 2022
Using some basic methods to show linkages and transformations of robotic arms

roboticArmVisualizer Python GUI application to create custom linkages and adjust joint angles. In the future, I plan to add 2d inverse kinematics solv

Sandesh Banskota 1 Nov 19, 2021
All materials of Cassandra Event, Udyam'22

Cassandra 2022 Workspace Workshop Materials Workshop-1 Workshop-2 Workshop-3 Workshop-4 Assignments Assignment-1 Assignment-2 Assignment-3 Resources P

36 Dec 31, 2022