当前位置:网站首页>The difference between classification, object detection, semantic segmentation, and instance segmentation
The difference between classification, object detection, semantic segmentation, and instance segmentation
2022-08-08 04:30:00 【expect 686】
There are many tasks in computer vision, including image classification, object detection, semantic segmentation, instance segmentation and panorama segmentation, etc. What is the difference between them?
1, Image Classification
Image classification (left in the picture below) is to determine the classification of the image, for example, in the learning classification, the data set is a person (person), a sheep (sheep),There are four kinds of dog (dog) and cat (cat). Image classification requires which categories are contained in a given image output image. For example, the example in the figure below contains three types of person, sheep and dog.

2, Object detection
target detection (above right) is simply what's in the picture?Where are they?(frame them in rectangles)
Currently commonly used target detection algorithms include Faster R-CNN and YOLO-based target detection algorithms
3, semantic segmentation
Target segmentation in the usual sense refers to semantic segmentation
Semantic segmentation (left in the picture below) is to distinguish every pixel in the picture, not just a rectangular frame.But different instances of the same object do not need to be segmented separately.On the left side of the picture below, label it as people, sheep, dogs, and grass.Instead of Sheep 1, Sheep 2, Sheep 3, Sheep 4, Sheep 5, etc.
4, Instance segmentation
Instance segmentation (right above) is actually a combination of **target detection and semantic segmentation**.Relative to the bounding box of target detection, instance segmentation can be accurate to the edge of the object; relative to semantic segmentation, instance segmentation needs to label different individuals of the same object on the map (sheep 1, sheep 2, sheep 3...)
The most commonly used instance segmentation algorithm is Mask R-CNN.
Mask R-CNN performs pixel-level segmentation by adding a branch to Faster R-CNN that outputs a binary mask indicating whether a given pixel is part of the target object: this branch is based on convolutionalFully Convolutional Network for Neural Network Feature Mapping.Taking a given convolutional neural network feature map as input, the output is a matrix where all positions where a pixel belongs to the object is represented by 1 and other positions are represented by 0, which is the binary mask.
Once these masks are generated, Mask R-CNN combines RoIAlign with classification and bounding boxes from Faster R-CNN for accurate segmentation:
5, Panoramic segmentation
Panoramic segmentationis a combination of semantic segmentation and instance segmentation.Different from instance segmentation: instance segmentation only detects objects in the image and segments the detected objects, while panorama segmentation is to detect and segment all objects in the image, including the background.
边栏推荐
- 机器学习笔记:学习率预热 warmup
- 语音鉴定软件
- NetCore使用Dapper查询数据
- 07查询表达式 及 page分页、order 排序《ThinkPHP6 入门到电商实战》
- leetcode 112.路经总和 递归
- 【保研面试】英文问题
- Heterogeneous on the Graph paper to share 】 【 small sample learning: HG - Meta: Graph Meta - learning over Heterogeneous Graphs
- awk语法-03-awk表达式(if语句、while循环、for循环)、awk中执行shell命令
- Building a High-Performance Platform on AWS Using Presto and Alluxio to Support Real-Time Gaming Services
- ES6解构赋值的使用说明
猜你喜欢

package package

32. Do you know how Redis strings are implemented?

This article will give you a thorough understanding of synchronized and Lock

Shell 脚本 — 多行注释、开启子/不开启子进程执行、转义带颜色输出、读取键盘输入、输入输出重定向、单双引号、命令替换、读取变量、系统变量、正则过滤、算术运算、一行多条命令、字符串比较

项目分析(嵌入式产品Web化)

leetcode: 122. 买卖股票的最佳时机 II

【冷启动】快手《POSO: Personalized Cold Start Modules for Large-scale Recommender Systems》

数据库篇复习篇

ES6解构赋值的使用说明

【多任务模型】《Multi-Faceted Hierarchical Multi-Task Learning for a Large Number of Tasks with Multi-dimens
随机推荐
2022-08-07 mysql/stonedb慢SQL-子查询-半连接
Heterogeneous on the Graph paper to share 】 【 small sample learning: HG - Meta: Graph Meta - learning over Heterogeneous Graphs
ES6解构赋值的使用说明
Some excellent blog recommendations for Qt event learning reference
MYSQL导出数据字典
MySQL4(多表查询)
使用 Presto 和 Alluxio 在 AWS 上搭建高性能平台来支持实时游戏服务
L3-007 天梯地图(测试点2卡住了可以看下)
一行代码统计文本中指定字符串出现的次数
【论文分享】异质图上的小样本学习:HG-Meta: Graph Meta-learning over Heterogeneous Graphs
如何在推荐系统中玩转知识图谱
基于MindSpore框架的数字调制信号盲识别研究
牛客多校第6场赛后学习 B(两种做法)G(两种做法)M(两种写法)J
Young freshmen who yearn for open source | The guide to avoiding pits from open source to employment is here!
Open3D 基于颜色的ICP配准
CARLA 笔记(05)— Actors and blueprints(创建和修改 Blueprint、生成 Spawning、使用 Handling、销毁 Destruction)
【图基础】如何定义异质图上的小样本学习:Heterogeneous Graph Few-Shot Learning
32. 你知道Redis的字符串是怎么实现的吗?
MindFusion.WPF Pack 2022.R1
LeetCode_485_Maximum number of consecutive 1s