当前位置:网站首页>[paper reading] [3D target detection] point transformer
[paper reading] [3D target detection] point transformer
2022-04-23 04:36:00 【Lukas88664】
Paper title :Point Transformer
iccv2021
use transformer Do some cloud work
Because of the randomness of the point cloud transformer Just deal with such problems

however Obviously, for large-scale point clouds Use it directly transformer The amount of computation is enormous So the author puts forward a new transformer Processing form , That is knn Find nearby points .
First The author introduces transformer The background of Self attention operation can be divided into scalar and vector attention
scalar It's what we often call the attention mechanism

and vector attention Then consider two token The relationship between

β Is a relational function ( Subtracting the ) γ For one mapping function
and Proposed by the author point transformer layer It mainly makes use of vector attention Module Consider two token The relationship between At the same time, add the location code to value in It means considering the relationship between position and value :


The specific operation is for the input primary and secondary points Let's start with linear Feature coding for Subtract them to get their relation, Then add the location code Conduct mlp After the operation, take their coded value Then proceed norm The weight matrix of Let the weight matrix be multiplied by the value function and position code to get the code k Output point characteristics of adjacent points Be careful Conduct transformer The point is that we first carried out a knn Of the query Select... Near the main point k Let's do it at one point value weighting .
The author's position coding adopts relative position Then a linear layer coding :

The overall framework of the article is :

You can see that the network framework is mainly pointnet++ Framework
SA The layer is replaced by tranformer Plus for knn Near the point max pooling Take the next sample And the upper sampling feeling is completely FP layer .

hinder ablation Comparison of the k Selection of adjacent points


Effectiveness of location coding

attention The necessity of the module
in general The innovation of the article is to put forward point transformer layer
However, this layer can mainly be operated indoors with dense point clouds For some automatic driving scenes The point cloud is very sparse Use... In these scenarios knn It's unwise to query the proximity point And the amount of calculation is huge .
The way of location coding can be learned !
版权声明
本文为[Lukas88664]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230407539040.html
边栏推荐
- Jetpack 之 LifeCycle 组件使用详解
- 为什么推荐你学嵌入式
- 协程与多进程的完美结合
- Nature medicine reveals individual risk factors of coronary artery disease
- 智能电子秤全国产化电子元件推荐方案
- Go reflection - go language Bible learning notes
- Thought of reducing Governance -- detailed summary of binary search
- Chlamydia infection -- causes, symptoms, treatment and Prevention
- [BIM introduction practice] Revit building wall: detailed picture and text explanation of structure, envelope and lamination
- 无线键盘全国产化电子元件推荐方案
猜你喜欢

win10, mysql-8.0.26-winx64. Zip installation

Fusobacterium -- symbiotic bacteria, opportunistic bacteria, oncobacterium

Chlamydia infection -- causes, symptoms, treatment and Prevention
![[BIM introduction practice] Revit building wall: detailed picture and text explanation of structure, envelope and lamination](/img/cb/86b5898609800a80592fceb782503f.png)
[BIM introduction practice] Revit building wall: detailed picture and text explanation of structure, envelope and lamination

Alibaba cloud IOT transfer to PostgreSQL database scheme

Xiaohongshu was exposed to layoffs of 20% as a whole, and the internal volume among large factories was also very serious

AWS EKS 部署要点以及控制台与eksctl创建的差异

递归调用--排列的穷举

Chapter 4 - understanding standard equipment documents, filters and pipelines

229. Find mode II
随机推荐
STM32 MCU ADC rule group multi-channel conversion DMA mode
io.Platform.packageRoot; // ignore: deprecated_member_use
【BIM+GIS】ArcGIS Pro2.8如何打开Revit模型,BIM和GIS融合?
Redis 命令大全
VHDL implementation of 32-bit binary to BCD code
STM32上μC/Shell移植与应用
Record your own dataset with d435i, run orbslam2 and build a dense point cloud
从MySQL数据库迁移到AWS DynamoDB
QtSpim手册-中文翻译
Common string processing functions in C language
Brushless motor drive scheme based on Infineon MCU GTM module
[AI vision · quick review of NLP natural language processing papers today, issue 31] Fri, 15 APR 2022
Error occurs when thymeleaf th: value is null
Thought of reducing Governance -- detailed summary of binary search
Mysql---数据读写分离、多实例
LabVIEW 小端序和大端序区别
无线键盘全国产化电子元件推荐方案
thymeleaf th:value 为null时报错问题
为什么推荐你学嵌入式
小红书被曝整体裁员20%,大厂之间内卷也很严重