当前位置：网站首页>3.1-Classification-probabilistic generative model

3.1-Classification-probabilistic generative model

2022-08-11 07:52:00 【A boa constrictor. 6666】

for the classification task,Similar to regression,The same is to find a function f,His input is an object x,The output is a category class n
There are many such classification tasks in reality,比如：
- 信用评分
  - 输入：收入,存款,职业,年龄,past financial situation
  - 输出：接受或拒绝
- 医学诊断
  - 输入：current symptoms、年龄、性别、Past medical records
  - 输出：哪种疾病
- 手写字符识别
  - 输入：手写字符
  - 输出：The corresponding numerically encoded character
- 人脸识别
  - 输入：face image
  - 输出：corresponding person

The following is an example of the classification of Pokémon,Every Pokémon can be used7properties to describe：Total,HP,Attack,Defense,SP Atk,SP Def,Speed.Below we will use these attributes to predict what category a Pokémon will belong to.
Suppose we haven't learned how to use classification to solve this problem,At this time, we solve this problem by means of regression,See what happens below.Take the problem of binary classification as an example：
- The regression model as shown will penalize those that are too correct,Output those points where the value is too large,The result obtained in this way is not good.

理想的选择（Ideal Alternatives）
- 定义一个模型g(x),当输入x时,输出大于0就输出class 1,否则输出class 2
- lossThe function is used to count the number of times the prediction was wrong in the training set
- The way to find the optimal solution is the perceptron（Perceptron）,支持向量机（SVM）,生成模型（Generative Model）

生成模型（Generative Model）
1. estimated from the training setx发生的概率P(x),This is the generative model.
1. 先验概率（Prior）：P(C₁)和P(C₂)It can be calculated based on the existing training set
1. 高斯分布（Gaussian distribution）：Any Pokémon can be represented by a set of their attribute vectors,下面我们取Defense,SP DefA two-dimensional vector of these two attributes to represent a Pokémon.Because of the turtle in the test set we cannot know its prior probabilityP(x),Therefore we assume that the existing training set is sampled from a Gaussian distribution,So now we can estimate what the prior probability is for this turtle.
  for a Gaussian distribution,它的输入是一个向量x,The output is a sampling probability $Font metrics not found for font: .$ ,The shape of its function is given by the mean𝝁和协方差矩阵𝜮确定

最大似然估计（Maximum Likelihood）：Since each point is independently sampled from a Gaussian distribution,而这79One point is that it is possible to sample from a different Gaussian distribution.for different Gaussian distributions,There will be different degrees of similarity（Different Likelihood）.So we need to find a similarity $Font metrics not found for font: .$ The highest Gaussian distribution $Font metrics not found for font: .$
The picture on the right is the average value of the water-type and normal-type Pokémon we actually calculated𝝁和协方差矩阵𝜮

The graph on the right is the accuracy we get based on the model's predictions on the test set,2The accuracy of each parameter is 47%,7The accuracy of each parameter is 54%,Obviously our model works very poorly,Continued optimization is required.

修正模型（Modifying Model）：A common practice is that different classes can share the same covariance matrix𝜮,This way we can reduce the variance by reducing the parameters of the model（variance）,This results in a simpler model.其中u¹和u²The algorithm has not changed,而𝜮became before𝜮¹和𝜮²The weighted average sum between the two.结果从53%提高到了73%

The figure on the left is a three-step analysis process for the task of classification,The picture on the right is that the probability distribution model we use is not necessarily a Gaussian distribution,If we have a binary classification problem,可以使用伯努利分布（Bernoulli distributions）;If all dimensions are assumed to be independent,Then a Naive Bayes classifier can be used（Naive Bayes Classifier）.

后验概率（Posterior Probability）：After deriving from a whole bunch of boring math,我们得到了最终的P(C₁|x)的数学表达式.但是为了得到w和b,在生成模型中,我们估计了𝑁1,𝑁2, 𝜇1, 𝜇2, Σ这么多的参数,It seems a little far-fetched,Why don't we just look for it right from the startw和b呢？We will delve into this issue in the next chapter on logistic regression.