当前位置:网站首页>Mobile/Embedded-CV Model-2017: MobelNets-v1
Mobile/Embedded-CV Model-2017: MobelNets-v1
2022-08-08 09:32:00 【u013250861】
The MobileNet model is a lightweight deep neural network proposed by Google for embedded devices such as mobile phones. The core idea of its use is depthwise separable convolution (depth separable convolution).
I. What is depthwise separable convolution (proposed by Mobilenet-v1)
Assume a network convolutional layer with a convolution kernel size of 3×3, 16 input channels and 32 output channels;
The conventional convolution operation is to apply 32 3×3×16 convolution kernels to the 16-channel input image, then according to the convolution layer parameter calculation formula, convolution calculation + convolution parameter amount + convolution measurementQuantity
The required parameters are 32*(3316+1)=4640.
If 16 convolution kernels (331) with a size of 3×3 are applied to the input image of 16 channels first, 16 feature maps are obtained, and the fusion operation is performed.Before, then use 32 convolution kernels (1116) with a size of 1×1 to traverse the 16 feature maps obtained above. According to the calculation formula of the parameters of the convolution layer, the required parameters are (33116+16) + (111632+32) = 706.
The above is the role of the depthwise separable convolution. In layman's terms, the feature extraction and feature combination of the ordinary convolutional layer are completed and output at one time, while the depthwise separable convolution first uses 33 with a thickness of 1.The convolution kernel (depthwise layered convolution) is used, and the number of channels is adjusted with a 11 convolution kernel (pointwise convolution), and feature extraction and feature combination are performed separately.
It can be seen that the depthwise separable convolution can greatly reduce the parameters of the model, and its specific structure is as follows (the left is the ordinary convolutional layer structure, the right is the depthwise separable convolution structure):
- Only one convolution kernel with dimension in_channels is used for feature extraction (no feature combination) when performing deepthwise (DW) convolution;
- When performing pointwise (PW) convolution, only output_channels convolution kernels with dimension in_channels 1*1 are used for feature combination.
References:
Lightweight Network - MobileNet
The application of deep learning in image processing (tensorflow2.4 and pytorch1.10 implementation)
Lightweight Network-Mobilenet Series(v1,v2,v3)
边栏推荐
猜你喜欢
随机推荐
VPP静态映射实现DNAT
你一定要看的安装及卸载测试用例的步骤及方法总结
ES8 | async and await
LVS负载均衡群集
Multi-scalar multiplication: state of the art & new ideas
什么是本质安全?
ACWing 198. Antiprime Problem Solution
使用分类权重,轻松解决数据不平衡的问题
Go 函数与方法
Web optimization experience (6) - optimization closed-loop practice
22-08-06 Xi'an EasyExcel implements dictionary table import and export
巧用Prometheus来扩展kubernetes调度器
SeeOD应用:He-Ne激光束聚焦物镜设计
让百姓消费更安全更放心更满意 江苏出台放心消费创建示范认定管理办法
LAN技术-5Eth-Trunk
DVWA full level detailed customs clearance tutorial
Recommended download software
数学基础(一)矩阵对角化、SVD分解以及应用
HyperLynx(三)传输线类型及相关设置
安装oracle19c时报错DBT-50000