当前位置:网站首页>Seven Steps to Hands-On Machine Learning
Seven Steps to Hands-On Machine Learning
2022-08-08 01:08:00 【Crazy jack Bauer】
Seven Steps to Hands-On Machine Learning
- Collecting data
- Prepare data
- Select a model
- Training
- Assessment
- Parameter adjustment
- Prediction
Let's use an example to see how each step works.
Case: Distinguishing between wine and beer
Collecting data
We bought a bunch of different types of beer and red wine at the supermarket, marked all the wines with their color and alcohol content, and recorded the data.This step is very important because the quantity and quality of the data directly determines the quality of the predictive model.
Data Preparation
In this example, our data is very neat, but in practice, the data we collect will have many problems, so it will involve data cleaning and other work.
When there is no problem with the data itself, we divide the data into 3 parts:
- Training set (60%): used to train the model
- Validation Set (20%): Make sure the model is not overfit
- Test set (20%): used to evaluate model performance
Select a model
Researchers and data scientists have created many models over the years.Some are great for image data, some are great for sequences (such as text or music), some are for numerical data, and some are for text-based data.
In our case, since we only have 2 features, color and alcohol, we can use a small linear model, which is a fairly simple one.
Training
Most people think this is the most important part, but it's not the case~ The quantity and quality of data, and the choice of model are more important than the training itself (3 minutes on the training knowledge desk, more important is10 years in the audience).
This process does not require human participation, the machine can complete it independently, and the whole process is like doing an arithmetic problem.Because the essence of machine learning is the process of transforming problems into mathematical problems and then solving them.
Assessment
Once the training is complete, you can evaluate whether the model is useful.This is where our previously reserved validation and test sets come into play.The evaluation indicators mainly include precision rate, recall rate, and F value.
This process allows us to see how the model is making predictions about numbers we haven't seen yet.This is meant to represent how the model will perform in the real world.
Parameter adjustment
After the evaluation is complete, we can further improve the training by adjusting the parameters.When we train, we implicitly assume some parameters that we can adjust to make the model perform better.
Forecast
Our 6 steps above are all for this step.This is also the value of machine learning.At this time, when we buy a new bottle of wine, just tell the machine its color and alcohol content, and he will tell you that the beer is still red wine.
References
https://easyai.tech/ai-knowledge-hub/
边栏推荐
猜你喜欢

机器学习的基础概念

leetcode 30. 串联所有单词的子串

【STM32】标准库-看门狗

sci 顶刊中的 3D 密度函数图

Enterprise learning (11) to build the main branch, trigger, nailing notice

(Note)航世BOW G19键盘 —— 使用说明书

BOSS direct hire again explodes erotic recruitment scandal, interviewer's behavior is unsightly

PAT甲级 1059 Prime Factors

kaggle小白必看:小白常见的2个错误解决方案

Distributed agreement of Lease mechanism
随机推荐
Rdkit常用命令/参数设置
第三章 矩阵压缩
SwiftUI使用MatchedGeometryEffect快速同步不同继承层级中视图的位置和尺寸
Octopus Application Chain|Content Curation Collaboration Organization DISCOVOL DAO Mainnet Launched in August
已知某路口发生事故的频率是平均每天2次,求此处一天内发生4次事故的概率。
陈强教授《机器学习及R应用》课程第十一章作业
Win11右键恢复经典win10最简单方法(基本0操作)
立秋是中稻收割的日子
手机股票开户是安全的吗,靠谱吗?
Chapter 3 Matrix Compression
Detailed QML ListView
每周小结(*68):为什么要打架
【程序员的七夕浪漫时刻】
正则表达式
2021 RoboCom 世界机器人开发者大赛-本科组(决赛)7-1绿色围栏(模拟)
Embedded sharing collection 31-serial port
Stream did not contain valid UTF-8 ——vs code 中 rust在debug时报错,中文乱码问题
OpenFeign简介和使用详解
如何将 Tex 转化为 Word 文件
【神器】MarkDown-沉浸写作的利器