当前位置:网站首页>[point cloud series] foldingnet: point cloud auto encoder via deep grid deformation
[point cloud series] foldingnet: point cloud auto encoder via deep grid deformation
2022-04-23 13:18:00 【^_^ Min Fei】
List of articles
Inventory clearing series
1. Summary
subject :FoldingNet: Point Cloud Auto encoder via Deep Grid Deformation (CVPR’18 spotlight)
The paper :https://openaccess.thecvf.com/content_cvpr_2018/papers/Yang_FoldingNet_Point_Cloud_CVPR_2018_paper.pdf
Supplementary materials :https://openaccess.thecvf.com/content_cvpr_2018/Supplemental/1129-supp.pdf
Code :https://www.merl.com/research/license#FoldingNet
sketch : be based on 2D The idea of paper folding came into being 3D object
Follow up extension :
- FoldingNet++
- Real-time Soft Robot 3D Proprioception via Deep Vision based Sensing
Similar work :
AtlasNet, It is the use of multiple grid Block to initialize 2D grid.
2. motivation
Can neural networks learn how to fold paper ?
3D Most of the point cloud data comes from the small area of the object surface, which can be regarded as 2D Fluid , Can pass 2D After a series of transformations, we get .
3. thought
The overall framework : Main design decoder
Mainly designed the decoder ,Encoder Use it directly PointNet Partial and simple graph ideas .
simply , Is to copy the hidden code N Share , Then with 2D The mesh is spliced , formation Nx(D+K) Hidden code of dimension , Obtain the point cloud with location information through learning .
In fact, it can be simply understood as a definition of Geometry : High dimensional data in nature can be expressed as low dimensional nonlinear manifolds . Here we use this idea to think of the three-dimensional point cloud as a two-dimensional manifold , Then a simple grid is used to simulate in two dimensions . So it is limited in that it can only simulate 3D When there is no ring in the , Not when there is a ring . That's why there's a back FoldingNet++ The emergence of is used to solve the problem of the emergence of rings .
Graph based encoder Encoder
Encoder = Multilayer perceptron + Graph based maximum pooling layer , The two are spliced together to form an encoder ;
The composition of the picture :16-KNN, For each point , The local covariance matrix of the calculator , Use 3 × 3 3\times3 3×3 Nuclear computing , Then vectorize it into 1 × 9 1\times9 1×9, So the input n × 3 n\times3 n×3–> n × 9 n\times9 n×9. The combination of the two is n × 12 n\times 12 n×12.
hypothesis KNN The adjacency matrix of a graph is A, Input is X. Then the output matrix is formula (2), among K Is the characteristic mapping matrix , Each input is a formula (3). The formula (3) Calculate local feature representation , Topological certainty .
be based on Folding The decoder of Decoder
decoder : Use 2 A continuous 3 Layer perceptron + 2D grid.
Input : Copy the m The characteristics of a share , Each feature 512 Dimension comes from encoder + Copy the m Share of 2D grid, each 2 Whitman's sign = m × 514 m \times 514 m×514.
there grid Use a square ,m=2025, Enter the number of points n=2048.
So what is it called Folding Operation? ?
This is the operation of the whole decoder . That is to say , Copied codewords and 2D grid The stitching of a point-based multilayer perceptron is called Folding operation .
The role of using two three-layer perceptrons ?
- The first is from 2D grid To 3D Spatial Folding operation ;
- The second is in 3D Space only Folding operation , Produce the final surface.
The theoretical analysis :
Assume 2Dgrid The input is a matrix U U U, The codeword output by the encoder is θ \theta θ, Every behavior of the matrix u i u_i ui
Through splicing and MLP after , It can be seen as f ( [ u i , θ ] ) f([u_i, \theta]) f([ui,θ]), The formula can be regarded as through codeword θ \theta θ Re parameterization of high-dimensional functions , because MLP It can be close to nonlinearity , Naturally, you can folding operation .
4. experimental result
Visual results of training :
Shape interpolation :
6. Reference resources
版权声明
本文为[^_^ Min Fei]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230611136529.html
边栏推荐
- 在 pytorch 中加载和使用图像分类数据集 Fashion-MNIST
- The first lesson is canvas, showing a small case
- Is Hongmeng system plagiarism? Or the future? Professional explanation that can be understood after listening in 3 minutes
- 基于uniapp异步封装接口请求简介
- Analysis of the latest Android high frequency interview questions in 2020 (BAT TMD JD Xiaomi)
- [wechat applet] flex layout usage record
- 2020最新Android大厂高频面试题解析大全(BAT TMD JD 小米)
- Nodejs + websocket cycle small case
- 【微信小程序】flex布局使用记录
- 「玩转Lighthouse」轻量应用服务器自建DNS解析服务器
猜你喜欢
How to build a line of code with M4 qprotex
The difference between string and character array in C language
FatFs FAT32 learning notes
three. JS text ambiguity problem
AUTOSAR from introduction to mastery 100 lectures (52) - diagnosis and communication management function unit
MySQL -- 16. Data structure of index
100 GIS practical application cases (52) - how to keep the number of rows and columns consistent and aligned when cutting grids with grids in ArcGIS?
MySQL 8.0.11下载、安装和使用可视化工具连接教程
Esp32 vhci architecture sets scan mode for traditional Bluetooth, so that the device can be searched
EMMC / SD learning notes
随机推荐
How to build a line of code with M4 qprotex
CSDN高校俱乐部“名师高校行”——湖南师范大学站
Async void caused the program to crash
【动态规划】221. 最大正方形
MySQL basic statement query
[official announcement] Changsha software talent training base was established!
mui picker和下拉刷新冲突问题
Common interview questions and detailed analysis of the latest Android developers in 2020
Imx6ull QEMU bare metal tutorial 1: GPIO, iomux, I2C
nodejs + mysql 实现简单注册功能(小demo)
The difference between string and character array in C language
Esp32 vhci architecture sets scan mode for traditional Bluetooth, so that the device can be searched
Mui close other pages and keep only the first page
数据仓库—什么是OLAP
SHA512 / 384 principle and C language implementation (with source code)
Mui wechat payment pit
Riscv MMU overview
Translation of multi modal visual tracking: review and empirical comparison
普通大学生如何拿到大厂offer?敖丙教你一招致胜!
RTOS mainstream assessment