当前位置:网站首页>[paper reading] [3D object detection] voxel set transformer: a set to set approach to 3D object detection from point clouds
[paper reading] [3D object detection] voxel set transformer: a set to set approach to 3D object detection from point clouds
2022-04-22 08:33:00 【Lukas88664】
Paper title :Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds
cvpr2022
transformer It has gradually become a new trend to use it on point clouds This article is to use transformer do 3d object detection .
The author first analyzes some existing methods Directly point by point on the point cloud transformer It doesn't work Because it's so big And some existing methods For example, add a point cloud group Do later trans It's hard to avoid missing some points Convert the point cloud to voxel Conduct 3d Convolution words be relative to transformer Come on The receptive field is very small . that Is there a way to enjoy trans Bring global features It can reduce the amount of calculation This article is written in this context :
Old rules Upper figure !

The article is in point level do You can see The author puts forward a relatively new point in the article vsa layer , Let's take a closer look at this structure :
This structure is for all point clouds Firstly, the author introduces set transformer The concept of , It is suggested that you can browse a little before reading this article nips An article on :《Set transformer: A framework for attention-based permutation-invariant neural networks》
The main idea is similar to linformer I think the self attention matrix is a low rank module So for the self attention module We can use two cross attetion To reduce the rank , And then reduce the rank of attention matrix Multiply with our input matrix .

And for all point clouds We'll start from here , First, map the point cloud to key and value Let's do a linear mapping operation And then calculate key And a reduced rank matrix L(K*D) Between cross attention And again value Multiply after softmax Calculation value After the weight of value Multiply obtain hidden feature This hidden feature For every point We put them according to voxel The division of is assigned to each voxel in Make up each voxel Of feature depth :


To constitute voxel chart We do deep convolution Extract different voxel Characteristics between This convolution is done twice Last but not least broadcast What will be obtained feature Project back to the original point For newly acquired feature We call them key and value Perform the similar with the low rank matrix crossattention After the operation Get new point feature

We can see Ahead encoder It's actually vfe The operation of

The above modules That is the focus of the full text vsa
Then the module is superimposed Keep extracting pointlevel The overall characteristics of The use of soft pool Project features onto bev

X Is that the bev voxel Characteristics of points within
The last part 2d Convolution Output detected head

The author believes that the network can also be extended to two stages The second stage The author makes use of cvpr2021 Of Lidar rcnn As refinement net


ablation The experiment made a replacement ffn And different latent codes And in pointrcnn Replace above Sa by transformer Here's how it works :

You can see The important thing is Conduct voxel Between the conv Can greatly improve ap Compare with the original sa transformer It can also better extract local features .
版权声明
本文为[Lukas88664]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204220728320256.html
边栏推荐
- 链表打印(链表反转输出)
- One question per day: improve the mold assembly on the 15th day of sprint in the big factory
- 反转一个链表<难度系数>
- Nacos Foundation (4): configure the external database of Nacos
- 第3关:节点状态检查、数据查看和更新
- Make the airtest ide into a script and make the script run
- MaterialApp
- 广城云服务实现每天定时填写一日一报
- 重启redis
- 617. 合并二叉树(Easy)
猜你喜欢
随机推荐
C的动态内存管理
Nacos Foundation (4): configure the external database of Nacos
Flutter基础
111. 二叉树的最小深度
构造函数与toString
ospf四类,五类和七类LSA详解
Machine learning notes - Mathematics in principal component analysis
ViewPager用法详细解析
第1关:继承
cesium中实现楼层分解动画
Flutter GridView Demo
oracle数据库表空间容量查询及扩容
链表打印(链表反转输出)
Asnotracking for efcore optimization
只有服务器,没有域名,怎么部署网站?
图片旋转
Flutter ListView 加载更多
用OnLayoutChangeListener的方法解决getTop=0的问题
Nacos Foundation (1): what is configuration center & introduction to Nacos
累加器









