当前位置:网站首页>Improving 3D object detection with channel wise transformer
Improving 3D object detection with channel wise transformer
2022-04-23 04:37:00 【Lukas88664】
title :Improving 3D Object Detection with Channel-wise Transformer
iccv2021
Zhejiang University Aridamo house The famous CT3d
The article is on the point transformer, Previous articles have said that on a huge point cloud tranformer The amount of computation is enormous So this article is a little clever utilize transformer Design refinement net
Article based on utilization The characteristics of the original point are sufficient to box refinement Based on the assumption of
Old rules Upper figure
You can see trans It is mainly used in the stage of encoding and decoding
Firstly, the author uses the output of single-stage network proposals Make a size expansion of them The concrete operation is to proposal Expand into a large cylinder The radius of the cylinder is proposal The diagonal of The height does not limit the size In such a cylinder We randomly choose 256 individual point Conduct transformer
transformer There is no innovation in the structure of The main reason is that the author combines location coding with feature A deeper fusion Instead of working before pe Simple addition of .
And then 256 Superposition of points obtain NXD The characteristic dimension of
The following is the focus of the full text :
decode link
Compared with the traditional decoder m individual query embedding Different We only have one here Because we only need to restore one proposal
Standard coding :
vector qK Each value of can be regarded as a global aggregation of a single point ( That is, each key is embedded ), Follow up softmax The function assigns a decoded value to each point according to the probability in the normalized vector . therefore , The value of decoding weight vector is a simple global aggregation , Lack of local channel modeling , This is necessary to understand the three-dimensional surface structure of point cloud , Because different channels usually show a strong geometric relationship in the point cloud .
channel code
and softmax The calculated decoding weight is associated with each channel , Ignore the global aggregation of each point . therefore , We can come to a conclusion , The standard decoding scheme focuses on global aggregation , The channel weighting scheme focuses on channel local aggregation . In order to combine their characteristics , We propose an extended channel weighting scheme , As shown below .
So we propose to combine the two schemes Mainly for The output of the previous module Conduct repeatD Time Then with Kh Multiply result softmax Then linear mapping
As a result, it can be compared with point Multiply
Finally, we analyze each dimension reweight Handle Output 1XN Of feature The result is convenient value Weighted Get our final output
Send the final output to two different ffn Test in iou and regression.
experiment :
ablation The influence of different pre single-stage detectors :
pe Advantages of modules
And different decoding The way is good or bad
In fact, it has not been greatly improved ...
The main reason I think is for proposal Come on Global features are enough
版权声明
本文为[Lukas88664]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230407538968.html
边栏推荐
- Express middleware ① (use of Middleware)
- 阿里十年技术专家联合打造“最新”Jetpack Compose项目实战演练(附Demo)
- 从MySQL数据库迁移到AWS DynamoDB
- Use recyclerview to realize left-right side-by-side classification selection
- win10, mysql-8.0.26-winx64.zip 安装
- 2020 is coming to an end, special and unforgettable.
- Bridge between ischemic stroke and intestinal flora: short chain fatty acids
- A heavy sword without a blade is a great skill
- Redis command Encyclopedia
- zynq平台交叉编译器的安装
猜你喜欢
Interaction of diet gut microbiota on cardiovascular disease
matlab读取多张fig图然后合并为一张图(子图的形式)
Thought of reducing Governance -- detailed summary of binary search
Inverse system of RC low pass filter
针对NFT的网络钓鱼
C语言常用字符串处理函数
zynq平台交叉编译器的安装
AWS EKS添加集群用户或IAM角色
Stm32f4 MCU ADC sampling and FFT of ARM-DSP Library
基于英飞凌MCU GTM模块的无刷电机驱动方案开源啦
随机推荐
Express middleware ① (use of Middleware)
RC低通滤波器的逆系统
SQL statement for adding columns in MySQL table
Installation and use of Apache bench (AB pressure test tool)
補:注解(Annotation)
zynq平臺交叉編譯器的安裝
TreeSet after class exercises
shell wc (统计字符数量)的基本使用
HMS Core Discovery第14期回顾长文|纵享丝滑剪辑,释放视频创作力
Express middleware ② (classification of Middleware)
Express中间件②(中间件的分类)
为什么推荐你学嵌入式
C语言: 指针的进阶
Detailed explanation of life cycle component of jetpack
补:注解(Annotation)
记录一下盲注脚本
A new method for evaluating the quality of metagenome assembly - magista
Eksctl deploying AWS eks
阿里云IoT流转到postgresql数据库方案
Single chip microcomputer serial port data processing (2) -- ucosiii + cyclic queue receiving data