mapReduce 输出结果导入Oracle，无效字符错误

原創

2020-03-19 00:54

mapReduce 输出结果导入Oracle，无效字符错误

使用map 读取数据，然后输入到Oracle，相信下面这段代码大家都不陌生，但是一直报错无效字符。

 Job job = new Job(conf, "Query_Job");

        job.setJarByClass(ImportDriver.class);
        job.setMapperClass(ImportMapper.class);
        job.setOutputKeyClass(ActiveIpD.class);
        job.setOutputValueClass(Text.class);
        FileInputFormat.addInputPath(job, new Path(input));
        
        //结果，输出到oracle
         job.setOutputFormatClass(DBOutputFormat.class);
        DBConfiguration.configureDB(xxx,xxx,xxx,xxx）
        DBOutputFormat.setOutput(job, "tableName",
                "id",
                "name",
          );
        job.setNumReduceTasks(0);

后来观察DBConfiguration源码发现，源码构造sql结束后，sql加了分号；Oracle不能识别分号。导致错误

 public String constructQuery(String table, String[] fieldNames) {
    if(fieldNames == null) {
      throw new IllegalArgumentException("Field names may not be null");
    }

    StringBuilder query = new StringBuilder();
    query.append("INSERT INTO ").append(table);

    if (fieldNames.length > 0 && fieldNames[0] != null) {
      query.append(" (");
      for (int i = 0; i < fieldNames.length; i++) {
        query.append(fieldNames[i]);
        if (i != fieldNames.length - 1) {
          query.append(",");
        }
      }
      query.append(")");
    }
    query.append(" VALUES (");

    for (int i = 0; i < fieldNames.length; i++) {
      query.append("?");
      if(i != fieldNames.length - 1) {
        query.append(",");
      }
    }
    query.append(");");

    return query.toString();
  }

重新方法去掉分号

package com.boco.querymr.util;

import org.apache.commons.logging.Log;
import org.apache.commons.logging.LogFactory;
import org.apache.hadoop.mapreduce.*;
import org.apache.hadoop.mapreduce.lib.db.DBConfiguration;
import org.apache.hadoop.mapreduce.lib.db.DBOutputFormat;
import org.apache.hadoop.mapreduce.lib.db.DBWritable;
import org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.util.StringUtils;

import java.io.IOException;
import java.sql.Connection;
import java.sql.PreparedStatement;
import java.sql.SQLException;


public class MyDBOutputFormat<K extends DBWritable, V> extends DBOutputFormat<K, V> {
    private static final Log LOG = LogFactory.getLog(MyDBOutputFormat.class);

 
   

    @Override
    public String constructQuery(String table, String[] fieldNames) {
        if (fieldNames == null) {
            throw new IllegalArgumentException("Field names may not be null");
        }

        StringBuilder query = new StringBuilder();
        query.append("INSERT INTO ").append(table);

        if (fieldNames.length > 0 && fieldNames[0] != null) {
            query.append(" (");
            for (int i = 0; i < fieldNames.length; i++) {
                query.append(fieldNames[i]);
                if (i != fieldNames.length - 1) {
                    query.append(",");
                }
            }
            query.append(")");
        }
        query.append(" VALUES (");

        for (int i = 0; i < fieldNames.length; i++) {
            query.append("?");
            if (i != fieldNames.length - 1) {
                query.append(",");
            }
        }
        query.append(")");
        LOG.info(query.toString());
        System.err.println("查询" + query.toString());
        return query.toString();
    }

    
    private static DBConfiguration setOutput(Job job,
                                             String tableName) throws IOException {
        job.setOutputFormatClass(MyDBOutputFormat.class);
        job.setReduceSpeculativeExecution(false);

        DBConfiguration dbConf = new DBConfiguration(job.getConfiguration());

        dbConf.setOutputTableName(tableName);
        return dbConf;
    }


}

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

mapReduce 输出结果导入Oracle，无效字符错误

如何使用 JS 判断用户是否处于活跃状态

Mono 支持LoongArch架构

lightdb秒级增加列和删除列（not null带默认值）

lightdb数据库超时相关控制参数

通过HPA+CronHPA组合应对业务复杂弹性伸缩场景

❤️‍🔥 Solon Cloud Event 新的事务特性与应用

lightdb mysql 8.0兼容之不可见主键

使用 JS 实现在浏览器控制台打印图片 console.image()

基于Ubuntu-22.04安装K8s-v1.28.2实验（四）使用域名访问网站应用

js 常用小功能 js 常用小功能

多線程：生產者消費者模型，心得筆記

java jar包找不到類

轉載兩個netty的入門示例

UnsupportedClassVersionError

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結