当前位置:网站首页>深度学习——超参数设置
深度学习——超参数设置
2022-04-23 15:08:00 【請叫我做雷锋】
一.过拟合
1.定义:给定一个假设空间H,一个假设h属于H,如果存在其他的假设h’属于H,使得在训练样例上h的错误率比h’小,但在整个实例分布上h’比h的错误率小,那么就说假设h过度拟合训练数据。
2.通俗解释
3.常见原因
主要是学习过度和样本特征不均衡,如果细分,还可以包括(并未能列举全部原因):
(1)建模样本选取有误,样本标签错误等,导致选取的样本数据不足以代表预定的分类规则
(2)样本噪音干扰过大,使得机器将学习了噪音,还认为是特征,从而扰乱了预设的分类规则
(3)假设的模型无法合理存在,或者说是假设成立的条件实际并不成立(4)参数太多,模型复杂度过高
(5)对于tree-based模型,如果我们对于其深度与split没有合理的限制,有可能使节点只包含单纯的事件数据(event)或非事件数据(no event),使其虽然可以完美匹配(拟合)训练数据,但是无法适应其他数据集
(6)对于神经网络模型:1).权值学习迭代次数太多(Overtraining),2).BP算法使权值可能收敛过于复杂的决策面。
4.解决方法
->模型上:神经网络:加dropout,batch normalization基于树的模型:限制深度,加入正则化项等设置提前终止条件。
->数据上:增加数据集对数据集进行增强处理(augmentation)。
二、正则化
预备知识(梯度下降法):https://zhuanlan.zhihu.com/p/113714840
1.正则化的目的:为了模型的泛化而添加的一个权值累加项。
版权声明
本文为[請叫我做雷锋]所创,转载请带上原文链接,感谢
https://blog.csdn.net/weixin_44646187/article/details/124341309
边栏推荐
- Detailed comparison between asemi three-phase rectifier bridge and single-phase rectifier bridge
- C语言超全学习路线(收藏让你少走弯路)
- Comparaison du menu de l'illustrateur Adobe en chinois et en anglais
- Have you learned the basic operation of circular queue?
- LeetCode167-两数之和II-双指针-二分-数组-查找
- [proteus simulation] automatic range (range < 10V) switching digital voltmeter
- Share 3 tools, edit 5 works at home and earn more than 400
- Daily question - leetcode396 - rotation function - recursion
- Explain TCP's three handshakes in detail
- Leetcode exercise - 396 Rotation function
猜你喜欢
eolink 如何助力远程办公
8.3 language model and data set
X509 certificate cer format to PEM format
What is the effect of Zhongfu Jinshi wealth class 29800? Walk with professional investors to make investment easier
Daily question - leetcode396 - rotation function - recursion
Introduction to distributed transaction Seata
How to design a good API interface?
Borui data and F5 jointly build the full data chain DNA of financial technology from code to user
Share 20 tips for ES6 that should not be missed
win10 任务栏通知区图标不见了
随机推荐
Vscode Chinese plug-in doesn't work. Problem solving
Tun equipment principle
MySQL sync could not find first log file name in binary log index file error
Nacos program connects to mysql8 0+ NullPointerException
我的 Raspberry Pi Zero 2W 折腾笔记,记录一些遇到的问题和解决办法
LeetCode167-两数之和II-双指针-二分-数组-查找
Basic operation of circular queue (Experiment)
小红书 timestamp2 (2022/04/22)
UML学习_day2
Design of digital temperature monitoring and alarm system based on DS18B20 single chip microcomputer [LCD1602 display + Proteus simulation + C program + paper + key setting, etc.]
X509 certificate cer format to PEM format
The wechat applet optimizes the native request through the promise of ES6
Nacos程序连接MySQL8.0+ NullPointerException
买卖股票的最佳时机系列问题
How to use OCR in 5 minutes
Grep was unable to redirect to the file
Explanation and example application of the principle of logistic regression in machine learning
Application of skiplist in leveldb
What is the effect of Zhongfu Jinshi wealth class 29800? Walk with professional investors to make investment easier
Go basic reflection