当前位置:网站首页>Deeply learn the skills of parameter adjustment
Deeply learn the skills of parameter adjustment
2022-04-23 15:27:00 【moletop】
How to adjust parameters :
-
batchsize Be suitable
-
epoch Be suitable , Observe the convergence , Prevent over fitting
-
Whether to add batch nomal
-
dropout If you need
-
Activate function selection : except gate Places like that , You need to limit the output to 0-1 outside , Try not to use sigmoid, It can be used tanh perhaps relu Activation functions like that .1. sigmoid Function in -4 To 4 Section in , There's a big gradient . Outside the range , The gradient is close to 0, It's easy to cause the gradient to disappear .2. Input 0 mean value ,sigmoid The output of the function is not 0 Mean .
-
Loss function round plus regular , A round without regularity
-
The choice of optimizer :adam,adadelta etc. , On small data , The effect of the experiment is not as good as sgd, sgd The convergence rate will be slower , But the final result of convergence , It's generally better . If you use sgd Words , You can choose from 1.0 perhaps 0.1 The learning rate started to , After a while , Check on the validation set , If cost No decline , Cut the learning rate by half . Many papers do this , The results of the experiment are also very good . Of course , You can also use ada The series starts with , At the end of the day , Replace it with sgd Keep training . There will also be improvements . It is said that adadelta In general, the effect of classification is better ,adam In the generation problem, the effect is better .
-
ensemble
-
The same parameters , Different initialization methods
-
Different parameters , adopt cross-validation, Choose the best groups
k Detailed explanation of folding and crossing :https://www.cnblogs.com/henuliulei/p/13686046.html
-
The same parameters , Different stages of model training , That is, models with different iterations .
-
Different models , Linear fusion . for example RNN And traditional models .
-
版权声明
本文为[moletop]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231523160668.html
边栏推荐
- 我的树莓派 Raspberry Pi Zero 2W 折腾笔记,记录一些遇到的问题和解决办法
- Modify the default listening IP of firebase emulators
- Mysql database explanation (8)
- GFS distributed file system (Theory)
- What exactly does the distributed core principle analysis that fascinates Alibaba P8? I was surprised after reading it
- Detailed explanation of C language knowledge points -- first understanding of C language [1] - vs2022 debugging skills and code practice [1]
- PHP 的运算符
- HJ31 单词倒排
- Byte interview programming question: the minimum number of K
- UML learning_ Day2
猜你喜欢
TLS / SSL protocol details (28) differences between TLS 1.0, TLS 1.1 and TLS 1.2
Openstack command operation
Special analysis of China's digital technology in 2022
Have you really learned the operation of sequence table?
Kubernetes详解(九)——资源配置清单创建Pod实战
Byte interview programming question: the minimum number of K
Squid agent
regular expression
Detailed explanation of kubernetes (IX) -- actual combat of creating pod with resource allocation list
Krpano panorama vtour folder and tour
随机推荐
Common types of automated testing framework ▏ automated testing is handed over to software evaluation institutions
el-tree实现只显示某一级复选框且单选
深度学习调参的技巧
如何设计一个良好的API接口?
Detailed explanation of MySQL connection query
激活函数的优缺点和选择
SSH connects to the remote host through the springboard machine
Use of common pod controller of kubernetes
C语言超全学习路线(收藏让你少走弯路)
PHP 的运算符
今日睡眠质量记录76分
Set onedrive or Google drive as a drawing bed in upic for free
[leetcode daily question] install fence
Modify the default listening IP of firebase emulators
Llvm - generate for loop
Rsync + inotify remote synchronization
调度系统使用注意事项
MySQL sync could not find first log file name in binary log index file error
我的树莓派 Raspberry Pi Zero 2W 折腾笔记,记录一些遇到的问题和解决办法
PHP PDO ODBC loads files from one folder into the blob column of MySQL database and downloads the blob column to another folder