当前位置:网站首页>[CV-Learning] Semantic Segmentation
[CV-Learning] Semantic Segmentation
2022-08-04 06:06:00 【Xiao Liang has to work hard】
Semantic segmentation
Definition: Assign a class label to each pixel of an image, regardless of instance, only consider pixel class.
Old ideas (sliding window)

Ideas: Take a certain pixel as the center, select an area, and classify the area.
Problem: The efficiency is too low, and the overlapping area features are calculated repeatedly.
New idea (full convolution)

Idea: Perform pooling during Convolutions to keep the image size and depth unchanged, and then obtain Scores with a depth of C, where each depth represents a category, and each pixel can be judged by category.Finally, by judging the sum of the cross-entropy loss of each pixel, the total loss is controlled by feedback, and the lower the better.
Problem: The original resolution of the image is maintained during the Convolutions process, which requires a huge amount of video memory.Therefore, the Convolutions process can be optimized to improve performance.
Performance improvements

Ideas: In the process of Convolutions, first down-sampling is performed to extract high-level semantic features, and then the original learning is returned through up-sampling to establish a high-level semantic-to-classmark mapping.
Downsampling
pooling, strided convolution
Upsample t
unpooling or strided transpose convolution
Hard-coded upsampling
Learnable Upsampling (Transpose convolution)
Downsampling
Upsampling
One-dimensional upsampling example
Filter is a filter kernel, which is a matrix that can be learned.
Convolution and matrix multiplication (one-dimensional)
The following legends are down-sampling first, and then up-sampling restoration.
Example 1: Step 1
Example2: Step size is 2
UNET
Upsampling is obtained based on the high-level semantics obtained by downsampling, but sometimes the high-level semantics do not work well, and low-level semantics need to be used.
In response to this problem, UNET is proposed, which integrates the low-level semantics in the downsampling process into the upsampling process, resulting in better results.
Integration ideas:
1. Copy the red feature channel on the left and combine it with the upsampled feature channel.
2. After the red feature channel on the left is processed by convolution, it is combined with the up-sampled feature channel.
边栏推荐
猜你喜欢

TensorFlow2学习笔记:4、第一个神经网模型,鸢尾花分类

逻辑回归---简介、API简介、案例:癌症分类预测、分类评估法以及ROC曲线和AUC指标

安装dlib踩坑记录,报错:WARNING: pip is configured with locations that require TLS/SSL

【CV-Learning】图像分类

NFT市场可二开开源系统

fill_between in Matplotlib; np.argsort() function

flink on yarn指定第三方jar包

TensorFlow2 study notes: 7. Optimizer

MySQL事务详解(事务隔离级别、实现、MVCC、幻读问题)

【CV-Learning】目标检测&实例分割
随机推荐
[Deep Learning 21-Day Learning Challenge] 3. Use a self-made dataset - Convolutional Neural Network (CNN) Weather Recognition
WARNING: sql version 9.2, server version 11.0.Some psql features might not work.
NFT市场开源系统
RecyclerView的用法
SQL练习 2022/7/3
SQL练习 2022/6/30
智能合约安全——delegatecall (2)
多项式回归(PolynomialFeatures)
数据库根据提纲复习
剑指 Offer 2022/7/3
SQL练习 2022/7/1
oracle的number与postgresql的numeric对比
(TensorFlow)——tf.variable_scope和tf.name_scope详解
Usage of Thread, Handler and IntentService
postgres 递归查询
线性回归简介01---API使用案例
flink-sql所有数据类型
彻底搞懂箱形图分析
Jupyter Notebook installed library;ModuleNotFoundError: No module named 'plotly' solution.
【树 图 科 技 头 条】2022年6月27日 星期一 今年ETH2.0无望








