当前位置:网站首页>Deep learning - Super parameter setting
Deep learning - Super parameter setting
2022-04-23 15:18:00 【Please call me Lei Feng】
One . Over fitting
1. Definition : Given a hypothetical space H, A hypothesis h Belong to H, If there are other assumptions h’ Belong to H, So in the training example h The error rate of h’ Small , But in the whole instance distribution h’ Than h Low error rate , So let's say suppose h Overfitting training data .
2. Popular explanation

3. Common causes
It is mainly over learning and unbalanced sample characteristics , If the segment , It can also include ( Not all the reasons ):
(1) Wrong modeling sample selection , Sample label error, etc , Causes the selected sample data to be insufficient to represent the intended classification rule
(2) Excessive sample noise interference , Make the machine learn the noise , Also considered a feature , Thus disturbing the preset classification rules
(3) The hypothetical model cannot reasonably exist , Or the conditions under which the hypothesis is true are not true (4) Too many parameters , Excessive model complexity
(5) about tree-based Model , If we compare its depth with split There are no reasonable restrictions , It is possible to make the node contain only simple event data (event) Or non-event data (no event), Make it a perfect match though ( fitting ) Training data , But it can't adapt to other data sets
(6) For the neural network model :1). Too many iterations of weight learning (Overtraining),2).BP The algorithm may make the weight converge to the decision surface which is too complex .
4. resolvent
-> Model : neural network : Add dropout,batch normalization Tree based model : Limit depth , Add regularization items and set early termination conditions .
-> Data on : Increase the data set and enhance the data set (augmentation).
Two 、 Regularization
Preliminary knowledge ( Gradient descent method ):https://zhuanlan.zhihu.com/p/113714840
1. The purpose of regularization : A weight accumulation term added for the generalization of the model .
版权声明
本文为[Please call me Lei Feng]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231508367130.html
边栏推荐
- Educational Codeforces Round 127 A-E题解
- Practice of unified storage technology of oppo data Lake
- Collation of errors encountered in the use of redis shake
- setcontext getcontext makecontext swapcontext
- 【thymeleaf】处理空值和使用安全操作符
- C语言超全学习路线(收藏让你少走弯路)
- 机器学习——逻辑回归
- Leetcode162 - find peak - dichotomy - array
- Have you really learned the operation of sequence table?
- Design of digital temperature monitoring and alarm system based on DS18B20 single chip microcomputer [LCD1602 display + Proteus simulation + C program + paper + key setting, etc.]
猜你喜欢

8.2 text preprocessing

Mysql连接查询详解

Basic operation of circular queue (Experiment)

asp. Net method of sending mail using mailmessage

UML学习_day2

Advanced version of array simulation queue - ring queue (real queuing)

Leetcode162 - find peak - dichotomy - array

8.3 language model and data set

win10 任务栏通知区图标不见了

Introduction to distributed transaction Seata
随机推荐
Detailed explanation of C language knowledge points -- first understanding of C language [1] - vs2022 debugging skills and code practice [1]
js——實現點擊複制功能
API gateway / API gateway (III) - use of Kong - current limiting rate limiting (redis)
The wechat applet optimizes the native request through the promise of ES6
Differential privacy (background)
Kubernetes详解(十一)——标签与标签选择器
After time judgment of date
The life cycle of key value in redis module programming
How to upload large files quickly?
Introduction to distributed transaction Seata
C language super complete learning route (collection allows you to avoid detours)
Three uses of kprobe
Redis cluster principle
【thymeleaf】处理空值和使用安全操作符
Mysql连接查询详解
X509 certificate cer format to PEM format
API gateway / API gateway (II) - use of Kong - load balancing
Practice of unified storage technology of oppo data Lake
A series of problems about the best time to buy and sell stocks
Alexnet model