当前位置:网站首页>Advantages, disadvantages and selection of activation function
Advantages, disadvantages and selection of activation function
2022-04-23 15:27:00 【moletop】
Activation function :
-
significance : Increase the nonlinear modeling ability of the network , If there is no activation function , Then the network can only express linear mapping , Even if there are more hidden layers , The whole network is also equivalent to the single-layer neural network
-
Characteristics required :1. Continuous derivable .2, As simple as possible , Improve network computing efficiency .3, The value range is in the appropriate range , Otherwise, it will affect the training efficiency and stability .
-
Saturation activation function :Sigmoid、Tanh. Unsaturated activation function :ReLu. And the output layer ( classifier ) Of softmax
-
The choice of activation function : In the hidden layer ReLu>Tanh>Sigmoid .RNN in :Tanh,Sigmoid. Output layer :softmax( Classification task ). Neuronal death occurs , It can be used PRelu.
1**.Sigmoid**:
advantage :<1> Sigmoid The value range of is (0, 1), Coincidence probability , And monotonically increasing , Easier to optimize .
<2> Sigmoid Derivation is easier , It can be directly deduced that .
shortcoming :
<1> Sigmoid The function converges slowly .
<2> because Sigmoid It's soft saturation , It's easy to produce gradients that disappear , It is not suitable for deep network training, which is easy to cause the gradient to disappear .
<3> Sigmoid The function is not in the form of (0,0) For the center , Ring breaking data distribution .
2.Tanh function
advantage :<1> The function outputs in (0,0) Centered .shortcoming :<1> tanh There is no solution sigmoid The problem of gradient disappearance .
3.ReLU function
advantage :<1> stay SGD The convergence rate is faster than Sigmoid and tanh Much faster
<2> It effectively alleviates the problem of gradient disappearance .
shortcoming :
<1> Neuron disappointment is easy to appear in the process of training ( Negative half axis ), Then the gradient is always 0 The situation of , Cause irreversible death .
<2> The derivative is 1, Alleviate the problem of gradient disappearance , But it's easy to explode .
4.ReLu improvement
版权声明
本文为[moletop]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231523160750.html
边栏推荐
- Llvm - generate local variables
- 今日睡眠质量记录76分
- C语言超全学习路线(收藏让你少走弯路)
- Subnet division of flannel principle
- 字节面试 transformer相关问题 整理复盘
- 码住收藏▏软件测试报告模板范文来了
- 如何设计一个良好的API接口?
- MySQL query library size
- Precautions for use of dispatching system
- Common types of automated testing framework ▏ automated testing is handed over to software evaluation institutions
猜你喜欢
Wechat applet customer service access to send and receive messages
[leetcode daily question] install fence
API gateway / API gateway (III) - use of Kong - current limiting rate limiting (redis)
T2 iCloud日历无法同步
Analysis of common storage types and FTP active and passive modes
API gateway / API gateway (II) - use of Kong - load balancing
T2 icloud calendar cannot be synchronized
Tun model of flannel principle
Sword finger offer (2) -- for Huawei
MySQL InnoDB transaction
随机推荐
el-tree实现只显示某一级复选框且单选
Byte interview programming question: the minimum number of K
Nacos program connects to mysql8 0+ NullPointerException
C language super complete learning route (collection allows you to avoid detours)
Ffmpeg installation error: NASM / yasm not found or too old Use --disable-x86asm for a clipped build
PHP PDO ODBC将一个文件夹的文件装载到MySQL数据库BLOB列,并将BLOB列下载到另一个文件夹
How to design a good API interface?
fatal error: torch/extension.h: No such file or directory
通过 PDO ODBC 将 PHP 连接到 MSSQL
My raspberry PI zero 2W tossing notes record some problems encountered and solutions
SSH connects to the remote host through the springboard machine
Educational Codeforces Round 127 A-E题解
TLS / SSL protocol details (30) RSA, DHE, ecdhe and ecdh processes and differences in SSL
regular expression
Have you learned the basic operation of circular queue?
我的树莓派 Raspberry Pi Zero 2W 折腾笔记,记录一些遇到的问题和解决办法
软件性能测试报告起着什么作用?第三方测试报告如何收费?
Basic operation of circular queue (Experiment)
Connect PHP to MSSQL via PDO ODBC
服务器中毒了怎么办?服务器怎么防止病毒入侵?