Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Last update: Dec 05, 2022

Related tags

Overview

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Overview

The ever-increasing 3D application makes the point cloud compression unprecedentedly important and needed. In this paper, we propose a patch-based compression process using deep learning, focusing on the lossy point cloud geometry compression. Unlike existing point cloud compression networks, which apply feature extraction and reconstruction on the entire point cloud, we divide the point cloud into patches and compress each patch independently. In the decoding process, we finally assemble the decompressed patches into a complete point cloud. In addition, we train our network by a patch-to-patch criterion, i.e., use the local reconstruction loss for optimization, to approximate the global reconstruction optimality. Our method outperforms the state-of-the-art in terms of rate-distortion performance, especially at low bitrates. Moreover, the compression process we proposed can guarantee to generate the same number of points as the input. The network model of this method can be easily applied to other point cloud reconstruction problems, such as upsampling.

Environment

Python 3.9.6 and Pytorch 1.9.0

Other dependencies:

pytorch3d 0.5.0 for KNN and chamfer loss: https://github.com/facebookresearch/pytorch3d

geo_dist for point to plane evaluation: https://github.com/mauriceqch/geo_dist

*For some unexpected reasons, we have rewritten the experimental code using a different environment and dependencies than in the paper. The training parameters and experimental results may be slightly different.

Data Preparation

You need ModelNet40 and ShapeNet to reproduce our results. The following steps will show you a general way to prepare point clouds in our experiment.

ModelNet40

Download the ModelNet40 data: http://modelnet.cs.princeton.edu

Convert CAD models(.off) to point clouds(.ply) by using sample_modelnet.py:

python ./sample_modelnet.py ./data/ModelNet40 ./data/ModelNet40_pc_8192 --n_point 8192

ShapeNet

Download the ShapeNet data here

Sampling point clouds by using sample_shapenet.py:

python ./sample_shapenet.py ./data/shapenetcore_partanno_segmentation_benchmark_v0_normal ./data/ShapeNet_pc_2048 --n_point 2048

Training

We use train_ae.py to train an autoencoder on ModelNet40 dataset:

python ./train_ae.py './data/ModelNet40_pc_8192/**/train/*.ply' './model/trained_128_16' --N 8192 --ALPHA 2 --K 128 --d 16

Compression and Decompression

We use compress.py and decompress.py to perform compress on point clouds using our trained autoencoder. Take the compression of ModelNet40 as an example:

python ./compress.py './model/trained_128_16' './data/ModelNet40_pc_8192/**/test/*.ply' './data/ModelNet40_pc_8192_compressed_128_16' --ALPHA 2

python ./decompress.py './model/trained_128_16' './data/ModelNet40_pc_8192_compressed_128_16' './data/ModelNet40_pc_8192_decompressed_128_16'

Evaluation

The Evaluation process uses the same software geo_dist as in Quach's code. We use eval.py to measure reconstruction quality and check the bitrate of the compressed file.

python ./eval.py ../geo_dist/build/pc_error './data/ModelNet40_pc_8192/**/test/*.ply' './data/ModelNet40_pc_8192_compressed_128_16' './data/ModelNet40_pc_8192_decompressed_128_16' './eval/ModelNet40_128_16.csv'

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Related tags

Overview

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Overview

Environment

Data Preparation

Training

Compression and Decompression

Evaluation

Owner

Cognition-aware Cognate Detection

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

Deep Learning Algorithms for Hedging with Frictions

Face Recognition and Emotion Detector Device

High-performance moving least squares material point method (MLS-MPM) solver.

Retinal vessel segmentation based on GT-UNet

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

MediaPipe is a an open-source framework from Google for building multimodal

Modelisation on galaxy evolution using PEGASE-HR

Neuron class provides LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neurons learned with Gradient descent or LeLevenberg–Marquardt algorithm

Non-stationary GP package written from scratch in PyTorch

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

MoCoGAN: Decomposing Motion and Content for Video Generation

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Unofficial PyTorch implementation of Guided Dropout