Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Last update: Dec 09, 2022

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Point cloud sequences are irregular and unordered in the spatial dimension while exhibiting regularities and order in the temporal dimension. Therefore, existing grid based convolutions for conventional video processing cannot be directly applied to spatio-temporal modeling of raw point cloud sequences. In the paper, we propose a point spatio-temporal (PST) convolution to achieve informative representations of point cloud sequences. The proposed PST convolution first disentangles space and time in point cloud sequences. Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension. Furthermore, we incorporate the proposed PST convolution into a deep network, namely PSTNet, to extract features of 3D point cloud sequences in a spatio-temporally hierarchical manner.

Installation

The code is tested with Red Hat Enterprise Linux Workstation release 7.7 (Maipo), g++ (GCC) 8.3.1, PyTorch v1.2, CUDA 10.2 and cuDNN v7.6.

Install PyTorch v1.2:

pip install torch==1.2.0 torchvision==0.4.0

Compile the CUDA layers for PointNet++, which we used for furthest point sampling (FPS) and radius neighbouring search:

cd modules
python setup.py install

To see if the compilation is successful, try to run python modules/pst_convolutions.py to see if a forward pass works.

Install Mayavi for point cloud visualization (optional). Desktop is required.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{fan2021pstnet,
    title={PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences},
    author={Hehe Fan and Xin Yu and Yuhang Ding and Yi Yang and Mohan Kankanhalli},
    booktitle={International Conference on Learning Representations},
    year={2021}
}

Related Repos

PointNet++ PyTorch implementation: https://github.com/facebookresearch/votenet/tree/master/pointnet2
MeteorNet: https://github.com/xingyul/meteornet
3DV: https://github.com/3huo/3DV-Action

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Related tags

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Installation

Citation

Related Repos

Owner

Hehe Fan

Code for 2021 NeurIPS --- Towards Multi-Grained Explainability for Graph Neural Networks

Experiments for Fake News explainability project

Quantify the difference between two arbitrary curves in space

RoadMap and preparation material for Machine Learning and Data Science - From beginner to expert.

Meta Learning for Semi-Supervised Few-Shot Classification

Code for our paper: Online Variational Filtering and Parameter Learning

Image Lowpoly based on Centroid Voronoi Diagram via python-opencv and taichi

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

Learning Calibrated-Guidance for Object Detection in Aerial Images

Learning Versatile Neural Architectures by Propagating Network Codes

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

Graph Representation Learning via Graphical Mutual Information Maximization

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Imagededup - 😎 Finding duplicate images made easy

Deep Semisupervised Multiview Learning With Increasing Views (IEEE TCYB 2021, PyTorch Code)

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

GNN-based Recommendation Benchma