当前位置:网站首页>[point cloud series] relationship based point cloud completion
[point cloud series] relationship based point cloud completion
2022-04-23 13:17:00 【^_^ Min Fei】
List of articles
1. Summary
TVCG 2021 Periodical , Point cloud completion
Address of thesis :https://ieeexplore.ieee.org/abstract/document/9528986
2. motivation
For the scene of multi object combination Partial completion
Pay close attention to : Whether the two objects are close in space in the scene
As shown in the figure : That is, two objects with similar space refer to each other to complete
3. Method
Problem definition :
Given a partial point cloud after segmentation , There are two parts , Belong to different objects
The goal is : Complete each part
Assume : Assume The input is obtained from a scan , This is more challenging than capturing scenes from multiple directions . In theory , Our method can also be used to construct multi-directional scanning .
Two way network :
Why two-way network : In fact, it's because of two possible incomplete objects O a O_a Oa and O b O_b Ob, Let's see which of these objects is completed first , If it's first A Again B So it's the left branch in the figure below , If it's first B Again A So it's the right branch .
Optimize the network by using consistency constraints , I hope the more similar the results of the two branches, the better . It is ensured by the parameter mapping of the graph .
reference :https://consistency.epfl.ch/
This way is also called : Conditional completion .
The definition is as follows :
Overall network framework :
Network based on self encoder :
The short answer is
- Encode all input point cloud features first , Then maximize the pool to get A global feature g 1 g_1 g1
- Then extend g 1 g_1 g1 And contain only the features of the object that need to be completed ( yellow ), Then maximize the pool into global features g 2 g_2 g2
- Last use TopNet To complete what needs to be completed
Loss function :
- Shape loss :EMD
- Loss of consistency : Defined in two different paths EMD distance .
Training process :
- Train first Step1 The Internet
- Training Step2 The Internet ;
- Train the whole network ;
4. experiment
Data sets :
from [29] Build it , That is to say Interaction context (ICON): Towards a geometric functionality
descriptor The data set of this article .
contain 6 Data sets of different combination types , Here's the picture 4:
- desk-chair
- vase-flower
- hanger-clothes
- basket-object
- handcart-object
- stand-hat
Partial scan simulation :
- First, calculate the region of interest , Pictured 4 Shown , Use IBS[28] To extract
- Then after extraction IBS Upper sampling point
- Then, the bounding boxes of these sampling points are calculated as the interactive region of interest .
- These bounding boxes are normalized by coordinates , The visitor's center is regarded as the central point of interaction .
- Then randomly sample different camera positions at the center of the region of interest 1024 A point for each object , Remove the situation where one object obscures another .
Visualization :
experimental result :
Ablation Experiment :
chart 2:
• v1: PCN-encoder + PCN-decoder
• v2: PCN-encoder + topnet-decoder
• v3: Our-encoder + PCN-decoder
• v4 (ours): Our-encoder + topnet-decoder
Yes no Verification of consistency loss :
Robust to noise
5. Spatial relations
This is partly because I am interested in , Pay attention to .
[27] Use the correlation matrix to encode the spatial relationship
[28] Put forward Interaction Bisector Surface (IBS) To capture interactive information , Include Geometric and topological features ;
[29] IBS + IR( Interaction area ) To encode more geometric features .
The above methods are used to complete 3D Of the scene
The following method is to capture the spatial relationship between point clouds :
[30] Figure network , combination 2D and 3D Set information to guide the expression of association relationship
[31] The overlap of the border after mapping Separated by the nearest distance between two objects , Then calculate the spatial correlation .
[32] and [33]: The joint Gaussian distribution is used to express the spatial relationship of different objects in a scene .
[34] Methods based on deep learning , Measure relationships by nearest neighbors
[35] Use the attention mechanism to capture the relationship between a point and its adjacent points .
Corresponding literature :
[27] M. Fisher, D. Ritchie, M. Savva, T. Funkhouser, and P. Hanrahan, “Example-based synthesis of 3d object arrangements,” ACM Trans. Graph., vol. 31, no. 6, pp. 135:1–135:11, 2012.
[28] X. Zhao, H. Wang, and T. Komura, “Indexing 3d scenes using the interaction bisector surface,” ACM Transactions on Graphics (TOG), vol. 33, no. 3, pp. 22:1–22:14, 2014.
[29] R. Hu, C. Zhu, O. van Kaick, L. Liu, A. Shamir, and H. Zhang, “Interaction context (ICON): Towards a geometric functionality descriptor,” ACM Transactions on Graphics, vol. 34, 2015.
[30] X. Qi, R. Liao, J. Jia, S. Fidler, and R. Urtasun, “3d graph neural networks for RGBD semantic segmentation,” in 2017 IEEE International Conference on Computer Vision (ICCV), 2017, pp. 5209–5218, ISSN: 2380-7504.
[31] Y. Song, Z. Sun, Y. Wu, and H. Li, “Coarse-to-fine segmentation for indoor scenes with progressive supervision,” Computer Aided Geometric Design, vol. 75, p. 101775, 2019.
[32] M. Alberti, P. Jensfelt, and J. Folkesson, “Relational approaches for joint object classification andscene similarity measurement in indoor environments,” in AAAI Spring Symposium Qualitative Representations for Robots March 24–26 2014, Palo Alto, USA. The AAAI Press, 2014.
[33] M. Sunkel, S. Jansen, M. Wand, and H.-P. Seidel, “A correlated parts model for object detection in large 3d scans,” in Computer Graphics Forum, vol. 32. Wiley Online Library, 2013, pp. 205–214.
[34] Y. Liu, B. Fan, S. Xiang, and C. Pan, “Relation-shape convolutional neural network for point cloud analysis,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 8895–8904.
[35] Z. Li, J. Zhang, G. Li, Y. Liu, and S. Li, “Graph attention neural networks for point cloud recognition,” in 2019 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2019-07, pp. 387–392.
版权声明
本文为[^_^ Min Fei]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230611136919.html
边栏推荐
- These vscode extensions are my favorite
- MySQL -- 16. Data structure of index
- [walking notes]
- According to the salary statistics of programmers in June 2021, the average salary is 15052 yuan. Are you holding back?
- @优秀的你!CSDN高校俱乐部主席招募!
- 51 single chip microcomputer stepping motor control system based on LabVIEW upper computer (upper computer code + lower computer source code + ad schematic + 51 complete development environment)
- Translation of multi modal visual tracking: review and empirical comparison
- Ffmpeg common commands
- Design of STM32 multi-channel temperature measurement wireless transmission alarm system (industrial timing temperature measurement / engine room temperature timing detection, etc.)
- 这几种 VSCode 扩展是我最喜欢的
猜你喜欢
Design of STM32 multi-channel temperature measurement wireless transmission alarm system (industrial timing temperature measurement / engine room temperature timing detection, etc.)
The filter() traverses the array, which is extremely friendly
Nodejs + Mysql realize simple registration function (small demo)
十万大学生都已成为猿粉,你还在等什么?
[dynamic programming] 221 Largest Square
Hbuilderx + uniapp packaging IPA submission app store stepping on the pit
MySQL 8.0.11 download, install and connect tutorials using visualization tools
Esp32 vhci architecture sets scan mode for traditional Bluetooth, so that the device can be searched
Summary of request and response and their ServletContext
100000 college students have become ape powder. What are you waiting for?
随机推荐
叮~ 你的奖学金已到账!C认证企业奖学金名单出炉
Uniapp image import local image not displayed
How do ordinary college students get offers from big factories? Ao Bing teaches you one move to win!
9419页最新一线互联网Android面试题解析大全
[walking notes]
web三大组件之Servlet
AUTOSAR from introduction to mastery lecture 100 (84) - Summary of UDS time parameters
Example interview | sun Guanghao: College Club grows and starts a business with me
Ffmpeg common commands
STM32 tracking based on open MV
Ding ~ your scholarship has arrived! C certified enterprise scholarship list released
“湘见”技术沙龙 | 程序员&CSDN的进阶之路
Nodejs + Mysql realize simple registration function (small demo)
CMSIS cm3 source code annotation
async void 導致程序崩潰
Solve the problem of Oracle Chinese garbled code
Melt reshape decast long data short data length conversion data cleaning row column conversion
POM of SSM integration xml
[notes de marche]
[quick platoon] 215 The kth largest element in the array