End-To-End Optimization of LiDAR Beam Configuration

Last update: Nov 28, 2022

Related tags

Overview

End-To-End Optimization of LiDAR Beam Configuration

This repository is the official implementation of the paper:

End-To-End Optimization of LiDAR Beam Configuration for 3D Object Detection and Localization

Niclas Vödisch, Ozan Unal, Ke Li, Luc Van Gool, and Dengxin Dai.

To appear in RA-L.

If you find our work useful, please consider citing our paper:

to be added after publication

📔 Abstract

Pre-determined beam configurations of low-resolution LiDARs are task-agnostic, hence simply using can result in non-optimal performance. In this work, we propose to optimize the beam distribution for a given target task via a reinforcement learning-based learning-to-optimize (RL-L2O) framework. We design our method in an end-to-end fashion leveraging the final performance of the task to guide the search process. Due to the simplicity of our approach, our work can be integrated with any LiDAR-based application as a simple drop-in module. In this repository, we provide the code for the exemplary task of 3D object detection.

🏗️ ️ Setup

To clone this repository and all submodules run:

git clone --recurse-submodules -j8 [email protected]:vniclas/lidar_beam_selection.git

⚙️ Installation

To install this code, please follow the steps below:

Create a conda environment: conda create -n beam_selection python=3.8
Activate the environment: conda activate beam_selection
Install dependencies: pip install -r requirements.txt
Install cudatoolkit (change to the used CUDA version):
conda install cudnn cudatoolkit=10.2
Install spconv (change to the used CUDA version):
pip install spconv-cu102
Install OpenPCDet (linked as submodule):
cd third_party/OpenPCDet && python setup.py develop && cd ../..
Install Pseudo-LiDAR++ (linked as submodule):
pip install -r third_party/Pseudo_Lidar_V2/requirements.txt
pip install pillow==8.3.2 (avoid runtime warnings)

💾 Data Preparation

Download KITTI 3D Object Detection dataset and extract the files:
1. Left color images image_2
2. Right color images image_3
3. Velodyne point clouds velodyne
4. Camera calibration matrices calib
5. Training labels label_2

Predict the depth maps:

Download pretrained model (training+validation)
Generate the data:

cd third_party/Pseudo_Lidar_V2  
python ./src/main.py -c src/configs/sdn_kitti_train.config \
--resume PATH_TO_CHECKPOINTS/sdn_kitti_object_trainval.pth --datapath PATH_TO_KITTI/training/ \
--data_list ./split/trainval.txt --generate_depth_map --data_tag trainval \
--save_path PATH_TO_DATA/sdn_kitti_train_set

Note: Please adjust the paths PATH_TO_CHECKPOINTS, PATH_TO_KITTI, and PATH_TO_DATA to match your setup.

Rename training/velodyne to training/velodyne_original
Symlink the KITTI folders to PCDet:
- ln -s PATH_TO_KITTI/training third_party/OpenPCDet/data/kitti/training
- ln -s PATH_TO_KITTI/testing third_party/OpenPCDet/data/kitti/testing

🏃 Running 3D Object Detection

Adjust paths in main.py. Further available parameters are listed in rl_l2o/eps_greedy_search.py and can be added in main.py.
Adjust the number of epochs of the 3D object detector in (we used 40 epochs):
- object_detection/compute_reward.py --> above the class definition
- third_party/OpenPCDet/tools/cfgs/kitti_models/pointpillar.yaml --> search for NUM_EPOCHS
  Note: If you use another detector, modify the respective configuration file.
Adjust the training scripts of the utilized detector to match your setup, e.g., object_detection/scripts/train_pointpillar.sh.
Initiate the search: python main.py
Note: Since we keep intermediate results to easily re-use them in later iterations, running the script will create a lot of data in the output_dir specified in main.py. You might want to manually delete some folders from time to time.

🔧 Adding more Tasks

Due to the design of the RL-L2O framework, it can be used as a simple drop-in module for many LiDAR applications. To apply the search algorithm to another task, just implement a custom RewardComputer, e.g., see object_detection/compute_reward.py. Additionally, you will have to prepare a set of features for each LiDAR beam. For the KITTI 3D Object Detection dataset, we provide the features as presented in the paper in object_detection/data/features_pcl.pkl.

👩‍⚖️ License

This software is made available for non-commercial use under a Creative Commons Attribution-NonCommercial 4.0 International License. A summary of the license can be found on the Creative Commons website.

End-To-End Optimization of LiDAR Beam Configuration

Related tags

Overview

End-To-End Optimization of LiDAR Beam Configuration

📔 Abstract

🏗️ ️ Setup

⚙️ Installation

💾 Data Preparation

🏃 Running 3D Object Detection

🔧 Adding more Tasks

👩‍⚖️ License

Owner

Niclas

A Python package for causal inference using Synthetic Controls

Methods to get the probability of a changepoint in a time series.

Locationinfo - A script helps the user to show network information such as ip address

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

FedTorch is an open-source Python package for distributed and federated training of machine learning models using PyTorch distributed API

Fully Convolutional DenseNet (A.K.A 100 layer tiramisu) for semantic segmentation of images implemented in TensorFlow.

VLG-Net: Video-Language Graph Matching Networks for Video Grounding

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Machine learning, in numpy

Out of Distribution Detection on Natural Adversarial Examples

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

Office source code of paper UniFuse: Unidirectional Fusion for 360$^\circ$ Panorama Depth Estimation

Koç University deep learning framework.

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Code repo for "Cross-Scale Internal Graph Neural Network for Image Super-Resolution" (NeurIPS'20)

rastrainer is a QGIS plugin to training remote sensing semantic segmentation model based on PaddlePaddle.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.