03-kylin插件
原始的datax是不能夠操作kylin數據源的,這個可以在datax的開源地址可以查看的到: https://github.com/alibaba/DataX ,但是kylin是支持使用標準jdbc 方式來訪問的,所以就可以使用datax提供的rdbmsreader 和 rdbmswriter 來往kylin裏面灌入數據。
步驟1:新建maven項目 引用kylin
<dependency>
<groupId>org.apache.kylin</groupId>
<artifactId>kylin-jdbc</artifactId>
<version>3.1.0</version>
</dependency>
步驟2:將kylin依賴包導入到本地
導出jar包到本地目錄:mvn dependency:copy-dependencies -DoutputDirectory=C:\\my\\out
步驟3:將本地kylin依賴jar包導入到datax插件目錄
步驟4:寫json
{
"job": {
"setting": {
"speed": {
"channel": 5,
"byte": 1048576
},
"errorLimit": {
"records": 1,
"percentage": 0
}
},
"content": [
{
"reader": {
"name": "rdbmsreader",
"parameter": {
"username": "uname",
"password": "pwd",
"connection": [
{
"querySql": [
"
select * from table
"],
"jdbcUrl": [
"jdbc:kylin://ip:7070/ddp_pro_BIReport"
]
}
],
"maxRetries": 3,
"splitPk": "id"
}
},
"writer": {
"name": "hbase11xwriter",
"parameter": {
"hbaseConfig": {
"hbase.zookeeper.quorum": ""
},
"table": "datamining:prediction_source_data",
"mode": "normal",
"rowkeyColumn": [
{
"index":1,
"type":"string"
},
{
"index":-1,
"type":"string",
"value":"#"
},
{
"index":0,
"type":"string"
}
],
"column": [
{
"index":2,
"name": "prediction:tc",
"type": "string"
}
],
"versionColumn":{
"index": -1,
"value":"123456789"
},
"encoding": "utf-8"
}
}
}
]
}
}
搞定!
參考:https://blog.csdn.net/supingemail/article/details/104420813