当前位置:网站首页>[point cloud series] foldingnet: point cloud auto encoder via deep grid deformation
[point cloud series] foldingnet: point cloud auto encoder via deep grid deformation
2022-04-23 13:18:00 【^_^ Min Fei】
List of articles
Inventory clearing series
1. Summary
subject :FoldingNet: Point Cloud Auto encoder via Deep Grid Deformation (CVPR’18 spotlight)
The paper :https://openaccess.thecvf.com/content_cvpr_2018/papers/Yang_FoldingNet_Point_Cloud_CVPR_2018_paper.pdf
Supplementary materials :https://openaccess.thecvf.com/content_cvpr_2018/Supplemental/1129-supp.pdf
Code :https://www.merl.com/research/license#FoldingNet
sketch : be based on 2D The idea of paper folding came into being 3D object
Follow up extension :
- FoldingNet++
- Real-time Soft Robot 3D Proprioception via Deep Vision based Sensing
Similar work :
AtlasNet, It is the use of multiple grid Block to initialize 2D grid.
2. motivation
Can neural networks learn how to fold paper ?
3D Most of the point cloud data comes from the small area of the object surface, which can be regarded as 2D Fluid , Can pass 2D After a series of transformations, we get .

3. thought
The overall framework : Main design decoder
Mainly designed the decoder ,Encoder Use it directly PointNet Partial and simple graph ideas .

simply , Is to copy the hidden code N Share , Then with 2D The mesh is spliced , formation Nx(D+K) Hidden code of dimension , Obtain the point cloud with location information through learning .
In fact, it can be simply understood as a definition of Geometry : High dimensional data in nature can be expressed as low dimensional nonlinear manifolds . Here we use this idea to think of the three-dimensional point cloud as a two-dimensional manifold , Then a simple grid is used to simulate in two dimensions . So it is limited in that it can only simulate 3D When there is no ring in the , Not when there is a ring . That's why there's a back FoldingNet++ The emergence of is used to solve the problem of the emergence of rings .

Graph based encoder Encoder

Encoder = Multilayer perceptron + Graph based maximum pooling layer , The two are spliced together to form an encoder ;
The composition of the picture :16-KNN, For each point , The local covariance matrix of the calculator , Use 3 × 3 3\times3 3×3 Nuclear computing , Then vectorize it into 1 × 9 1\times9 1×9, So the input n × 3 n\times3 n×3–> n × 9 n\times9 n×9. The combination of the two is n × 12 n\times 12 n×12.
hypothesis KNN The adjacency matrix of a graph is A, Input is X. Then the output matrix is formula (2), among K Is the characteristic mapping matrix , Each input is a formula (3). The formula (3) Calculate local feature representation , Topological certainty .


be based on Folding The decoder of Decoder


decoder : Use 2 A continuous 3 Layer perceptron + 2D grid.
Input : Copy the m The characteristics of a share , Each feature 512 Dimension comes from encoder + Copy the m Share of 2D grid, each 2 Whitman's sign = m × 514 m \times 514 m×514.
there grid Use a square ,m=2025, Enter the number of points n=2048.
So what is it called Folding Operation? ?
This is the operation of the whole decoder . That is to say , Copied codewords and 2D grid The stitching of a point-based multilayer perceptron is called Folding operation .
The role of using two three-layer perceptrons ?
- The first is from 2D grid To 3D Spatial Folding operation ;
- The second is in 3D Space only Folding operation , Produce the final surface.
The theoretical analysis :
Assume 2Dgrid The input is a matrix U U U, The codeword output by the encoder is θ \theta θ, Every behavior of the matrix u i u_i ui
Through splicing and MLP after , It can be seen as f ( [ u i , θ ] ) f([u_i, \theta]) f([ui,θ]), The formula can be regarded as through codeword θ \theta θ Re parameterization of high-dimensional functions , because MLP It can be close to nonlinearity , Naturally, you can folding operation .
4. experimental result
Visual results of training :

Shape interpolation :


6. Reference resources
版权声明
本文为[^_^ Min Fei]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230611136529.html
边栏推荐
- AUTOSAR from introduction to mastery 100 lectures (50) - AUTOSAR memory management series - ECU abstraction layer and MCAL layer
- “湘见”技术沙龙 | 程序员&CSDN的进阶之路
- Playwright contrôle l'ouverture de la navigation Google locale et télécharge des fichiers
- Three channel ultrasonic ranging system based on 51 single chip microcomputer (timer ranging)
- torch. Where can transfer gradient
- Filter and listener of three web components
- 5 tricky activity life cycle interview questions. After learning, go and hang the interviewer!
- Design of body fat detection system based on 51 single chip microcomputer (51 + OLED + hx711 + US100)
- Metalama简介4.使用Fabric操作项目或命名空间
- Imx6ull QEMU bare metal tutorial 1: GPIO, iomux, I2C
猜你喜欢

CMSIS cm3 source code annotation

Solve the problem of Oracle Chinese garbled code

Use Proteus to simulate STM32 ultrasonic srf04 ranging! Code+Proteus

AUTOSAR from introduction to mastery 100 lectures (51) - AUTOSAR network management

Loading and using image classification dataset fashion MNIST in pytorch

Complete project data of UAV apriltag dynamic tracking landing based on openmv (LabVIEW + openmv + apriltag + punctual atom four axes)

9419 page analysis of the latest first-line Internet Android interview questions

Kernel error: no rule to make target 'Debian / canonical certs pem‘, needed by ‘certs/x509_ certificate_ list‘

EMMC / SD learning notes
![[wechat applet] flex layout usage record](/img/ab/7b2392688d8a0130e671f179e09dce.png)
[wechat applet] flex layout usage record
随机推荐
These vscode extensions are my favorite
AUTOSAR from introduction to mastery lecture 100 (84) - Summary of UDS time parameters
@Excellent you! CSDN College Club President Recruitment!
9419 page analysis of the latest first-line Internet Android interview questions
Lpddr4 notes
100 GIS practical application cases (51) - a method for calculating the hourly spatial average of NC files according to the specified range in ArcGIS
Mui wechat payment pit
[notes de marche]
Introduction to metalama 4 Use fabric to manipulate items or namespaces
Nodejs + Mysql realize simple registration function (small demo)
ESP32 VHCI架构传统蓝牙设置scan mode,让设备能被搜索到
Design and manufacture of 51 single chip microcomputer solar charging treasure with low voltage alarm (complete code data)
解决虚拟机中Oracle每次要设置ip的问题
9419页最新一线互联网Android面试题解析大全
POM of SSM integration xml
Common interview questions and detailed analysis of the latest Android developers in 2020
Esp32 vhci architecture sets scan mode for traditional Bluetooth, so that the device can be searched
Conflict between Mui picker and drop-down refresh
You and the 42W bonus pool are one short of the "Changsha bank Cup" Tencent yunqi innovation competition!
Hbuilderx + uniapp packaging IPA submission app store stepping on the pit