当前位置:网站首页>CVPR 2022 | QueryDet:使用级联稀疏query加速高分辨率下的小目标检测
CVPR 2022 | QueryDet:使用级联稀疏query加速高分辨率下的小目标检测
2022-04-23 20:04:00 【智源社区】
虽然在过去的几年中,基于深度学习的通用目标检测已经取得了巨大的成功,但在检测小目标的性能和效率方面却远远不能令人满意。推广小目标检测最常见和有效的方法是使用高分辨率图像或特征图。然而,这两种方法都会导致昂贵的计算,因为计算成本会随着图像和特征大小的增加而增加。
我们提出了 QueryDet,它使用一种新颖的查询机制来加快基于特征金字塔的目标检测器的推断速度。该 pipeline 由两个步骤组成:首先在低分辨率特征上预测小目标的粗定位,然后利用这些粗位置稀疏引导的高分辨率特征计算出准确的检测结果。这样既可以获得高分辨率 feature map 的 benefit,又可以避免对背景区域使用较少的计算量。
在 popular COCO 数据集上,该方法将 mAP 提高了 1.0,mAP-small 提高了2.0,将高分辨率的推理速度平均提高到 3.0×。在包含更多小对象的 VisDrone 数据集上,我们获取了新的 SOTA,同时获得了平均 2.3× 高分辨率的加速。
论文标题:QueryDet: Cascaded Sparse Query for Accelerating High-Resolution for Small Object Detection
论文链接:https://arxiv.org/abs/2103.09136
代码链接:https://github.com/ChenhongyiYang/QueryDet-PyTorch

版权声明
本文为[智源社区]所创,转载请带上原文链接,感谢
https://hub.baai.ac.cn/views/16620
边栏推荐
- Mfcc: Mel frequency cepstrum coefficient calculation of perceived frequency and actual frequency conversion
- Deep learning -- Summary of Feature Engineering
- Vericrypt file hard disk encryption tutorial
- Comment créer un pass BEP - 20 sur la chaîne BNB
- Unity general steps for creating a hyper realistic 3D scene
- [H264] hevc H264 parsing and frame rate setting of the old version of libvlc
- Speex Wiener filter and rewriting of hypergeometric distribution
- Devops integration - environment variables and building tools of Jenkins service
- 【文本分类案例】(4) RNN、LSTM 电影评价倾向分类,附TensorFlow完整代码
- JVM的类加载过程
猜你喜欢

Garbage collector and memory allocation strategy

NiO related Basics

【webrtc】Add x264 encoder for CEF/Chromium

【文本分类案例】(4) RNN、LSTM 电影评价倾向分类,附TensorFlow完整代码

LeetCode异或运算

JVM的类加载过程

Project training of Software College of Shandong University - Innovation Training - network security shooting range experimental platform (6)

Project training of Software College of Shandong University - Innovation Training - network security shooting range experimental platform (V)

Lottery applet, mother no longer have to worry about who does the dishes (assign tasks), so easy

【数值预测案例】(3) LSTM 时间序列电量预测,附Tensorflow完整代码
随机推荐
uIP1. 0 actively sent problem understanding
antd dropdown + modal + textarea导致的textarea光标不可被键盘控制问题
Deep learning -- Summary of Feature Engineering
Openharmony open source developer growth plan, looking for new open source forces that change the world!
Understanding various team patterns in scrum patterns
Data analysis learning directory
Virtual machine performance monitoring and fault handling tools
RuntimeError: Providing a bool or integral fill value without setting the optional `dtype` or `out`
Openharmony open source developer growth plan, looking for new open source forces that change the world!
Zero base to build profit taking away CPS platform official account
Mysql database - connection query
山东大学软件学院项目实训-创新实训-网络安全靶场实验平台(七)
Zero cost, zero foundation, build profitable film and television applet
The difference between underline and dot of golang import package
Shanda Wangan shooting range experimental platform project - personal record (V)
MySQL advanced lock - overview of MySQL locks and classification of MySQL locks: global lock (data backup), table level lock (table shared read lock, table exclusive write lock, metadata lock and inte
Golang timer
【h264】libvlc 老版本的 hevc h264 解析,帧率设定
Is meituan, a profit-making company with zero foundation, hungry? Coupon CPS applet (with source code)
程序设计语言基础(2)