当前位置:网站首页>Explanation of input components in Chapter 16
Explanation of input components in Chapter 16
2022-04-23 13:24:00 【Feige big data】
introduction
In the last article , We introduced :XML input stream (StAX) Various detailed settings of components , Finally, the actual combat demonstrates how to operate it to read the data on the disk xml file 、 And the operation of cleaning data .
In this article , Let's go on to introduce :kettle Input components in , Previously, we introduced some common input components . This article expands to explain some uncommon components , These components are very simple to operate , So I'll just show you how to operate , I won't explain the options in detail as before , You can see how to use .
transformation
transformation (transaformation) yes ETL The main part of the solution , It handles extraction 、 transformation 、 Loading various operations on data lines .
Create transformations
What we have to do ETL operation , It's all designed in transformation , So we need to create a transformation first .
Save conversion
Give you a new conversion , Name it , And save
Generate records
brief introduction
You need to turn part of the text data into data lines , Each field acts as a column of a data row , Then you can use this component
Preview the record
Generate random number
brief introduction
In the use of Kettle During data warehousing , It often involves for ID Field generates a unique identifier , and UUID It's usually our choice to generate a unique representation , This is the time to use “ Generate random number ” Functional operators , adopt “ Generate random number ” Can generate UIID.
On a daily basis , Sometimes I just want to test with some data , But I don't want to get it from the database or Excel Input , Then you can generate random numbers .
Preview the record ( Execute many times )
You can see , The result of each run is different , Because every time it's a random number .
Custom constant data
brief introduction
Custom constant data , It's generation key-value Constant data in the form of .
Preview the record
Get system information
brief introduction
System information refers to Kettle Information about the system environment , Include the date of the computer system , Time type information such as week , Computer name 、IP Address 、 Start and end time of conversion or job ( Used for incremental data processing )、 Information about the completion of the previous job and command line parameters, etc .
By obtaining system information components , We can get these data .
choice Type of system information
List Kettle All types of information are supported
Customize several system changes
Preview the record
Conclusion
This article mainly explains : Generate record component 、 Generate random number components 、 Custom constant data component 、 Get system information components .
As of this article ,kettle Input components for , All the commonly used basics are introduced . The main explanation in the follow-up is :kettle The output components of .
brother , In fact, there is a distance between thinking and acting , If you think about it, it's gone , But you're doing it , It landed .
Don't say anything , Brothers, follow me and it's over , We still break up the way of kneading to say . The following content is more wonderful , Coming soon , Thank you for your attention !!
版权声明
本文为[Feige big data]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231151524383.html
边栏推荐
- 100 GIS practical application cases (53) - making three-dimensional image map as the base map of urban spatial pattern analysis
- Imx6ull QEMU bare metal tutorial 2: usdhc SD card
- You and the 42W bonus pool are one short of the "Changsha bank Cup" Tencent yunqi innovation competition!
- Common interview questions and detailed analysis of the latest Android developers in 2020
- [dynamic programming] 221 Largest Square
- RTOS mainstream assessment
- Machine learning -- model optimization
- 100 GIS practical application cases (52) - how to keep the number of rows and columns consistent and aligned when cutting grids with grids in ArcGIS?
- FatFs FAT32 learning notes
- Conflict between Mui picker and drop-down refresh
猜你喜欢
Imx6ull QEMU bare metal tutorial 2: usdhc SD card
Solve the problem that Oracle needs to set IP every time in the virtual machine
AUTOSAR from introduction to mastery 100 lectures (52) - diagnosis and communication management function unit
CMSIS cm3 source code annotation
Learning notes of AMBA protocol
AUTOSAR from introduction to mastery 100 lectures (51) - AUTOSAR network management
C语言之字符串与字符数组的区别
[point cloud series] relationship based point cloud completion
nodeJs + websocket 循环小案例
CSDN高校俱乐部“名师高校行”——湖南师范大学站
随机推荐
Playwright contrôle l'ouverture de la navigation Google locale et télécharge des fichiers
[point cloud series] unsupervised multi task feature learning on point clouds
You and the 42W bonus pool are one short of the "Changsha bank Cup" Tencent yunqi innovation competition!
AUTOSAR from introduction to mastery 100 lectures (51) - AUTOSAR network management
"Play with Lighthouse" lightweight application server self built DNS resolution server
在 pytorch 中加载和使用图像分类数据集 Fashion-MNIST
mui 微信支付 排坑
Using open to open a file in JNI returns a - 1 problem
[point cloud series] so net: self organizing network for point cloud analysis
Introduction to metalama 4 Use fabric to manipulate items or namespaces
hbuilderx + uniapp 打包ipa提交App store踩坑记
[dynamic programming] 221 Largest Square
[Technical Specification]: how to write technical documents?
2021年6月程序员工资统计,平均15052元,你拖后腿了吗?
TERSUS笔记员工信息516-Mysql查询(2个字段的时间段唯一性判断)
【动态规划】221. 最大正方形
Esp32 vhci architecture sets scan mode for traditional Bluetooth, so that the device can be searched
ECDSA signature verification principle and C language implementation
vscode小技巧
playwright控制本地谷歌浏览打开,并下载文件