当前位置:网站首页>Explanation of input components in Chapter 16
Explanation of input components in Chapter 16
2022-04-23 13:24:00 【Feige big data】
introduction
In the last article , We introduced :XML input stream (StAX) Various detailed settings of components , Finally, the actual combat demonstrates how to operate it to read the data on the disk xml file 、 And the operation of cleaning data .
In this article , Let's go on to introduce :kettle Input components in , Previously, we introduced some common input components . This article expands to explain some uncommon components , These components are very simple to operate , So I'll just show you how to operate , I won't explain the options in detail as before , You can see how to use .
transformation
transformation (transaformation) yes ETL The main part of the solution , It handles extraction 、 transformation 、 Loading various operations on data lines .
Create transformations
What we have to do ETL operation , It's all designed in transformation , So we need to create a transformation first .
Save conversion
Give you a new conversion , Name it , And save
Generate records
brief introduction
You need to turn part of the text data into data lines , Each field acts as a column of a data row , Then you can use this component
Preview the record
Generate random number
brief introduction
In the use of Kettle During data warehousing , It often involves for ID Field generates a unique identifier , and UUID It's usually our choice to generate a unique representation , This is the time to use “ Generate random number ” Functional operators , adopt “ Generate random number ” Can generate UIID.
On a daily basis , Sometimes I just want to test with some data , But I don't want to get it from the database or Excel Input , Then you can generate random numbers .
Preview the record ( Execute many times )
You can see , The result of each run is different , Because every time it's a random number .
Custom constant data
brief introduction
Custom constant data , It's generation key-value Constant data in the form of .
Preview the record
Get system information
brief introduction
System information refers to Kettle Information about the system environment , Include the date of the computer system , Time type information such as week , Computer name 、IP Address 、 Start and end time of conversion or job ( Used for incremental data processing )、 Information about the completion of the previous job and command line parameters, etc .
By obtaining system information components , We can get these data .
choice Type of system information
List Kettle All types of information are supported
Customize several system changes
Preview the record
Conclusion
This article mainly explains : Generate record component 、 Generate random number components 、 Custom constant data component 、 Get system information components .
As of this article ,kettle Input components for , All the commonly used basics are introduced . The main explanation in the follow-up is :kettle The output components of .
brother , In fact, there is a distance between thinking and acting , If you think about it, it's gone , But you're doing it , It landed .
Don't say anything , Brothers, follow me and it's over , We still break up the way of kneading to say . The following content is more wonderful , Coming soon , Thank you for your attention !!
版权声明
本文为[Feige big data]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231151524383.html
边栏推荐
- You and the 42W bonus pool are one short of the "Changsha bank Cup" Tencent yunqi innovation competition!
- 9419 page analysis of the latest first-line Internet Android interview questions
- [point cloud series] pointfilter: point cloud filtering via encoder decoder modeling
- [indicators] precision, recall
- 【快排】215. 数组中的第K个最大元素
- Nodejs + Mysql realize simple registration function (small demo)
- LeetCode_DFS_中等_695.岛屿的最大面积
- 【动态规划】221. 最大正方形
- 2020最新Android大厂高频面试题解析大全(BAT TMD JD 小米)
- 100000 college students have become ape powder. What are you waiting for?
猜你喜欢
./gradlew: Permission denied
Imx6ull QEMU bare metal tutorial 1: GPIO, iomux, I2C
MySQL 8.0.11下载、安装和使用可视化工具连接教程
Interface idempotency problem
@Excellent you! CSDN College Club President Recruitment!
[point cloud series] deepmapping: unsupervised map estimation from multiple point clouds
Imx6ull QEMU bare metal tutorial 2: usdhc SD card
[point cloud series] so net: self organizing network for point cloud analysis
Example interview | sun Guanghao: College Club grows and starts a business with me
[quick platoon] 215 The kth largest element in the array
随机推荐
Xi'an CSDN signed a contract with Xi'an Siyuan University, opening a new chapter in IT talent training
PyTorch 21. NN in pytorch Embedding module
hbuilderx + uniapp 打包ipa提交App store踩坑记
这几种 VSCode 扩展是我最喜欢的
Processbuilder tool class
Ding ~ your scholarship has arrived! C certified enterprise scholarship list released
基于uniapp异步封装接口请求简介
Remove the status bar
ESP32 VHCI架构传统蓝牙设置scan mode,让设备能被搜索到
Interface idempotency problem
100 GIS practical application cases (51) - a method for calculating the hourly spatial average of NC files according to the specified range in ArcGIS
How to build a line of code with M4 qprotex
mysql 基本语句查询
2020年最新字节跳动Android开发者常见面试题及详细解析
Mysql数据库的卸载
nodeJs + websocket 循环小案例
Stack protector under armcc / GCC
MySQL 8.0.11下载、安装和使用可视化工具连接教程
Common interview questions and detailed analysis of the latest Android developers in 2020
filter()遍历Array异常友好