当前位置:网站首页>Improving 3D object detection with channel wise transformer
Improving 3D object detection with channel wise transformer
2022-04-23 04:37:00 【Lukas88664】
title :Improving 3D Object Detection with Channel-wise Transformer
iccv2021
Zhejiang University Aridamo house The famous CT3d
The article is on the point transformer, Previous articles have said that on a huge point cloud tranformer The amount of computation is enormous So this article is a little clever utilize transformer Design refinement net
Article based on utilization The characteristics of the original point are sufficient to box refinement Based on the assumption of
Old rules Upper figure
You can see trans It is mainly used in the stage of encoding and decoding
Firstly, the author uses the output of single-stage network proposals Make a size expansion of them The concrete operation is to proposal Expand into a large cylinder The radius of the cylinder is proposal The diagonal of The height does not limit the size In such a cylinder We randomly choose 256 individual point Conduct transformer
transformer There is no innovation in the structure of The main reason is that the author combines location coding with feature A deeper fusion Instead of working before pe Simple addition of .
And then 256 Superposition of points obtain NXD The characteristic dimension of
The following is the focus of the full text :
decode link
Compared with the traditional decoder m individual query embedding Different We only have one here Because we only need to restore one proposal
Standard coding :
vector qK Each value of can be regarded as a global aggregation of a single point ( That is, each key is embedded ), Follow up softmax The function assigns a decoded value to each point according to the probability in the normalized vector . therefore , The value of decoding weight vector is a simple global aggregation , Lack of local channel modeling , This is necessary to understand the three-dimensional surface structure of point cloud , Because different channels usually show a strong geometric relationship in the point cloud .
channel code
and softmax The calculated decoding weight is associated with each channel , Ignore the global aggregation of each point . therefore , We can come to a conclusion , The standard decoding scheme focuses on global aggregation , The channel weighting scheme focuses on channel local aggregation . In order to combine their characteristics , We propose an extended channel weighting scheme , As shown below .
So we propose to combine the two schemes Mainly for The output of the previous module Conduct repeatD Time Then with Kh Multiply result softmax Then linear mapping
As a result, it can be compared with point Multiply
Finally, we analyze each dimension reweight Handle Output 1XN Of feature The result is convenient value Weighted Get our final output
Send the final output to two different ffn Test in iou and regression.
experiment :
ablation The influence of different pre single-stage detectors :
pe Advantages of modules
And different decoding The way is good or bad
In fact, it has not been greatly improved ...
The main reason I think is for proposal Come on Global features are enough
版权声明
本文为[Lukas88664]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230407538968.html
边栏推荐
猜你喜欢
[BIM introduction practice] Revit building wall: detailed picture and text explanation of structure, envelope and lamination
Kotlin. The binary version of its metadata is 1.6.0, expected version is 1.1.15.
win10, mysql-8.0.26-winx64.zip 安装
Express中间件②(中间件的分类)
matlab读取多张fig图然后合并为一张图(子图的形式)
Xiaohongshu was exposed to layoffs of 20% as a whole, and the internal volume among large factories was also very serious
Matlab minimalist configuration of vscode configuration
zynq平台交叉编译器的安装
Key points of AWS eks deployment and differences between console and eksctl creation
那些年我面试过的Android开发岗总结(附面试题+答案解析)
随机推荐
Understand the gut organ axis, good gut and good health
Migrate from MySQL database to AWS dynamodb
Express middleware ① (use of Middleware)
Cortex-M3寄存器组、汇编语言与C语言的接口介绍
从MySQL数据库迁移到AWS DynamoDB
Express中间件①(中间件的使用)
Introduction to Cortex-M3 register set, assembly language and C language interface
Common string processing functions in C language
兼容NSR20F30NXT5G的小体积肖特基二极管
Qtspim manual - Chinese Translation
Matlab minimalist configuration of vscode configuration
Redis command Encyclopedia
减治思想——二分查找详细总结
Create VPC in AWS console (no plate)
AWS EKS 部署要点以及控制台与eksctl创建的差异
Alibaba cloud IOT transfer to PostgreSQL database scheme
国外LEAD,联盟经理常见问答
Use recyclerview to realize left-right side-by-side classification selection
在AWS控制台创建VPC(无图版)
Error occurs when thymeleaf th: value is null