Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

Last update: Dec 06, 2022

Related tags

Deep Learning MonoFlex

Overview

MonoFlex

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21.

Work in progress.

Installation

This repo is tested with Ubuntu 20.04, python==3.7, pytorch==1.4.0 and cuda==10.1

conda create -n monoflex python=3.7

conda activate monoflex

Install PyTorch and other dependencies:

conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch

pip install -r requirements.txt

Build DCNv2 and the project

cd models/backbone/DCNv2

. make.sh

cd ../../..

python setup develop

Data Preparation

Please download KITTI dataset and organize the data as follows:

#ROOT		
  |training/
    |calib/
    |image_2/
    |label/
    |ImageSets/
  |testing/
    |calib/
    |image_2/
    |ImageSets/

Then modify the paths in config/paths_catalog.py according to your data path.

Training & Evaluation

Training with one GPU. (TODO: The multi-GPU training will be further tested.)

CUDA_VISIBLE_DEVICES=0 python tools/plain_train_net.py --batch_size 8 --config runs/monoflex.yaml --output output/exp

The model will be evaluated periodically (can be adjusted in the CONFIG) during training and you can also evaluate a checkpoint with

CUDA_VISIBLE_DEVICES=0 python tools/plain_train_net.py --config runs/monoflex.yaml --ckpt YOUR_CKPT  --eval

You can also specify --vis when evaluation to visualize the predicted heatmap and 3D bounding boxes. The pretrained model for train/val split and logs are here.

Note: we observe an obvious variation of the performance for different runs and we are still investigating possible solutions to stablize the results, though it may inevitably due to the utilized uncertainties.

Citation

If you find our work useful in your research, please consider citing:

@InProceedings{MonoFlex,
    author    = {Zhang, Yunpeng and Lu, Jiwen and Zhou, Jie},
    title     = {Objects Are Different: Flexible Monocular 3D Object Detection},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {3289-3298}
}

Acknowlegment

The code is heavily borrowed from SMOKE and thanks for their contribution.

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

Related tags

Overview

MonoFlex

Installation

Data Preparation

Training & Evaluation

Citation

Acknowlegment

Owner

Yunpeng

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

Attentional Focus Modulates Automatic Finger‑tapping Movements

A simple baseline for the 2022 IEEE GRSS Data Fusion Contest (DFC2022)

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

In this project, we create and implement a deep learning library from scratch.

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

A library for implementing Decentralized Graph Neural Network algorithms.

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

A PyTorch implementation of "Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning", IJCAI-21

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CCT)

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Repositório da disciplina de APC, no segundo semestre de 2021

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

Generative Art Using Neural Visual Grammars and Dual Encoders

Minimal PyTorch implementation of YOLOv3