当前位置:网站首页>[thesis study] Yolo V2
[thesis study] Yolo V2
2022-04-21 12:38:00 【Coke Daniel】
1、 Original paper
link , Extraction code :mk96
2、 The thought of thesis
3、 Thesis translation
YOLO 9000 Better 、 faster 、 stronger
Abstract
We have introduced an advanced 、 In real time 、 Can detect more than 9000 Type of target detection system -YOLO 9000. First, we put forward some suggestions for YOLO The improvement of this detection algorithm , There are some new things in these improvements , We also refer to some previous work . The improved model YOLO v2 stay Pascal voc and COCO The performance of these standard testing tasks is the best . A novel multi-scale training method is used to make the same YOLO v2 The model can run on different scales , And a good balance between speed and accuracy . With 67fps The detection speed of ,YOLO v2 stay voc Obtained on dataset 76.8 Of mAP. With 40fps The detection speed of ,YOLO v2 To obtain the 78.6 Of mAP, Than some advanced algorithms, such as resnet Of RCNN and SSD Good performance , And significantly faster . Finally, we propose a method of joint training target detection and classification . In this way , We are at the same time COCO Target detection and ImageNet Simultaneous training on image classification data set YOLO v2. This kind of joint training makes YOLO v2 It can detect the categories of detection data that are not marked . We are ImageNet Verify our method in the detection task . Even though 200 There are only 44 Detection data of categories ,YOLO v2 stay ImageNet The detection verification set obtained 19.7 Of mAP. In not belonging to COCO Of 156 Among categories ,YOLO v2 To obtain the 16 Of mAP. however ,YOLO The categories that can be detected are not just 200 Kind of , It can detect more than 9000 Different target categories . also , He is still real-time detection .
1、 introduction
A general purpose target detection system should be fast 、 Accurate and able to recognize a large number of objects . Since the introduction of neural networks , The detection system becomes more and more fast and accurate . however , A large number of detection methods can still detect only a small number of objects .
Compared with classified or data sets , The current target detection data set is limited . The most common detection data sets contain thousands to hundreds of thousands of images with hundreds of labels . The classified dataset has millions of images in tens of thousands or hundreds of thousands of categories .
We hope the detection can reach the classification level . however , Detect the picture annotation in the task , Far more precious than sorting or labeling ( Tags are usually provided free of charge by users ). So in the near future , We are unlikely to see the detection data set reach the size of the classification data set .
We propose a new method to use the large amount of classified data we already have , And use it to expand the detection range of our current target detection system . Our method uses a hierarchical view of target classification , It allows us to combine different data sets .
We also propose a joint training algorithm , This algorithm enables us to train our target detector on the detected and classified data . This algorithm uses the labeled detection images to learn to locate objects accurately , At the same time, the classification image is used to improve his vocabulary and robustness .
Using this method, we trained YOLOv2 This is real-time , Can detect more than 9000 A target detection system for different types of objects . First of all we have YOLO Improved on the basis of YOLO v2 This advanced real-time detector . Then we use data set merging method and joint training algorithm , stay ImagNet Of 9000 Categories and COCO The model is trained on the detection data set .

picture 1:YOLO v2 It can detect a large number of object categories in real time .
2、 Better
YOLO Compared with the advanced target detection system, there are a lot of problems .YOLO And Fast RCNN The error comparison analysis shows that YOLO There are a lot of positioning errors .YOLO And based on region proposal Compared with ,recall Relatively low . So we want to maintain the classification accuracy , Focus on improving recall And precise positioning .
Computer vision
版权声明
本文为[Coke Daniel]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204211229227186.html
边栏推荐
- What's the difference between you and the architect? How to improve
- 2020batjz Android Senior Engineer Interview Questions - multiple choice questions collection (with answer analysis)
- 2022年监理工程师考试质量、投资、进度控制练习题及答案
- Kubernetes service discovery monitoring apiserver
- 顶流“健身博主”刘畊宏
- 字段行相同则合并在另外一个字段sql 语句?
- Discussion on content and cost of elastic Architecture - Digital Architecture Design (3)
- 2022年一级注册建筑师考试建筑物理与设备复习题及答案
- 第四章 SQL查询之-层次化查询
- Reverse crawler 30 verification code of a fourth generation slider
猜你喜欢

How does electron net send a post request
![[linked list] 148 Sort linked list](/img/1a/7f0c00af9d14ac0f7f011c4b4641a1.png)
[linked list] 148 Sort linked list

Study notes of "deep learning" (V)

PSI open source library based on homomorphic encryption-1

Three. JS learning project -- 3D data visualization of anti US aggression and aid Korea

2022年一级注册建筑师考试建筑物理与设备复习题及答案

终于有人讲明白了!原来这才是全球低时延一张网技术

Binary tree traversal series 02 Morris traversal

Title record of TIANTI competition (II)

2022年监理工程师考试基本理论与相关法规练习题及答案
随机推荐
Pytroch deep learning run cifar10 data set
PSI open source library based on homomorphic encryption-1
第四章 SQL查询之-层次化查询
2022年初级会计职称考试经济法基础练习题及答案
A tag or < router link > use target = "_blank" to open a new page, and the sessionstorage of the new page is lost
高并发IO的底层原理
IPEmotion采集J1939协议信号
AcWing 1854. 晋升计数 题解 解方程推式子
Go语言 反射
Anchor-free新网络:CenterNet携手CornerNet形成了CenterNet++
2022年监理工程师合同管理练习题及答案
业内视频超分辨率新标杆,快手&大连理工研究登上CVPR 2022
[software test series IX] description of matters to be provided for pressure test application
[Software Test Series VII] software test plan
Aicoco AI frontier promotion (4.21)
R语言中实现随机森林建模的包randomForest
[software test series 8] software project test report
一文了解全面静态代码分析
打包的jar能正常运行,正常连接数据库。exe4j生成exe后执行到访问数据库就报错:
Laugh to death! The programmer's life extension guide is open source