03-kylin插件
原始的datax是不能够操作kylin数据源的,这个可以在datax的开源地址可以查看的到: https://github.com/alibaba/DataX ,但是kylin是支持使用标准jdbc 方式来访问的,所以就可以使用datax提供的rdbmsreader 和 rdbmswriter 来往kylin里面灌入数据。
步骤1:新建maven项目 引用kylin
<dependency>
<groupId>org.apache.kylin</groupId>
<artifactId>kylin-jdbc</artifactId>
<version>3.1.0</version>
</dependency>
步骤2:将kylin依赖包导入到本地
导出jar包到本地目录:mvn dependency:copy-dependencies -DoutputDirectory=C:\\my\\out
步骤3:将本地kylin依赖jar包导入到datax插件目录
步骤4:写json
{
"job": {
"setting": {
"speed": {
"channel": 5,
"byte": 1048576
},
"errorLimit": {
"records": 1,
"percentage": 0
}
},
"content": [
{
"reader": {
"name": "rdbmsreader",
"parameter": {
"username": "uname",
"password": "pwd",
"connection": [
{
"querySql": [
"
select * from table
"],
"jdbcUrl": [
"jdbc:kylin://ip:7070/ddp_pro_BIReport"
]
}
],
"maxRetries": 3,
"splitPk": "id"
}
},
"writer": {
"name": "hbase11xwriter",
"parameter": {
"hbaseConfig": {
"hbase.zookeeper.quorum": ""
},
"table": "datamining:prediction_source_data",
"mode": "normal",
"rowkeyColumn": [
{
"index":1,
"type":"string"
},
{
"index":-1,
"type":"string",
"value":"#"
},
{
"index":0,
"type":"string"
}
],
"column": [
{
"index":2,
"name": "prediction:tc",
"type": "string"
}
],
"versionColumn":{
"index": -1,
"value":"123456789"
},
"encoding": "utf-8"
}
}
}
]
}
}
搞定!
参考:https://blog.csdn.net/supingemail/article/details/104420813