当前位置:网站首页>[paper reading] [3D target detection] point transformer
[paper reading] [3D target detection] point transformer
2022-04-23 04:36:00 【Lukas88664】
Paper title :Point Transformer
iccv2021
use transformer Do some cloud work
Because of the randomness of the point cloud transformer Just deal with such problems
however Obviously, for large-scale point clouds Use it directly transformer The amount of computation is enormous So the author puts forward a new transformer Processing form , That is knn Find nearby points .
First The author introduces transformer The background of Self attention operation can be divided into scalar and vector attention
scalar It's what we often call the attention mechanism
and vector attention Then consider two token The relationship between
β Is a relational function ( Subtracting the ) γ For one mapping function
and Proposed by the author point transformer layer It mainly makes use of vector attention Module Consider two token The relationship between At the same time, add the location code to value in It means considering the relationship between position and value :
The specific operation is for the input primary and secondary points Let's start with linear Feature coding for Subtract them to get their relation, Then add the location code Conduct mlp After the operation, take their coded value Then proceed norm The weight matrix of Let the weight matrix be multiplied by the value function and position code to get the code k Output point characteristics of adjacent points Be careful Conduct transformer The point is that we first carried out a knn Of the query Select... Near the main point k Let's do it at one point value weighting .
The author's position coding adopts relative position Then a linear layer coding :
The overall framework of the article is :
You can see that the network framework is mainly pointnet++ Framework
SA The layer is replaced by tranformer Plus for knn Near the point max pooling Take the next sample And the upper sampling feeling is completely FP layer .
hinder ablation Comparison of the k Selection of adjacent points
Effectiveness of location coding
attention The necessity of the module
in general The innovation of the article is to put forward point transformer layer
However, this layer can mainly be operated indoors with dense point clouds For some automatic driving scenes The point cloud is very sparse Use... In these scenarios knn It's unwise to query the proximity point And the amount of calculation is huge .
The way of location coding can be learned !
版权声明
本文为[Lukas88664]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230407539040.html
边栏推荐
- 【Echart】echart 入门
- Stm32f4 MCU ADC sampling and FFT of ARM-DSP Library
- Gut liver axis: host microbiota interaction affects hepatocarcinogenesis
- 为什么推荐你学嵌入式
- Installation of zynq platform cross compiler
- 【时序】基于 TCN 的用于序列建模的通用卷积和循环网络的经验评估
- 华为机试--高精度整数加法
- [AI vision · quick review of today's sound acoustic papers, issue 2] Fri, 15 APR 2022
- SQL statement for adding columns in MySQL table
- 在AWS控制台创建VPC(无图版)
猜你喜欢
Difference between LabVIEW small end sequence and large end sequence
AWS EKS添加集群用户或IAM角色
无线充电全国产化电子元件推荐方案
单片机串口数据处理(2)——uCOSIII+循环队列接收数据
为什么推荐你学嵌入式
Coinbase: basic knowledge, facts and statistics about cross chain bridge
Kotlin. The binary version of its metadata is 1.6.0, expected version is 1.1.15.
Fusobacterium -- symbiotic bacteria, opportunistic bacteria, oncobacterium
Introduction to Cortex-M3 register set, assembly language and C language interface
Microbial neuroimmune axis -- the hope of prevention and treatment of cardiovascular diseases
随机推荐
Matlab minimalist configuration of vscode configuration
Thought of reducing Governance -- detailed summary of binary search
为什么推荐你学嵌入式
Installation and use of Apache bench (AB pressure test tool)
Go反射—Go语言圣经学习笔记
【时序】基于 TCN 的用于序列建模的通用卷积和循环网络的经验评估
递归调用--排列的穷举
mysql ,binlog 日志查询
VHDL语言实现32位二进制数转BCD码
shell wc (统计字符数量)的基本使用
2019 is coming to an end, the longest day.
Why recommend you to study embedded
Coinbase:关于跨链桥的基础知识、事实和统计数据
LabVIEW 小端序和大端序区别
AWS EKS添加集群用户或IAM角色
Mysql---数据读写分离、多实例
华为机试--高精度整数加法
MATLAB lit plusieurs diagrammes fig et les combine en un seul diagramme (sous forme de sous - Diagramme)
IDE Idea 自动编译 与 On Upate Action 、 On Frame Deactivation 的配置
STM32上μC/Shell移植与应用