hadoop 設置了reduce但是無法執行的bug

原創

lsxy117

2020-02-22 07:40

今天做mapreduce開發的時候，遇到個詭異的問題，設置了reduce方法，但是就是沒有執行。

爲了進一步驗證reduce是否執行，特地在reduce方法裏添加了一些提示信息的輸出，查看後臺task日誌文件裏面確實沒有對應的打印內容，說明reduce沒有執行。

hadoop版本：1.2.1

開發工具：eclipse

設計的Reducer方法如下：

public static class KPIBrowserIPReducer extends
Reducer<Text, IntWritable, Text, Text> {
private Text outvalue = new Text();

public void reduce(Text key, Iterator<IntWritable> values,
Context context) throws IOException, InterruptedException {
int sum = 0;
while (values.hasNext()) {
sum = sum + 1;
}
outvalue.set(Integer.toString(sum));
System.err.println("key : " + key + "----outvalue : " + outvalue);
context.write(key, outvalue);
}

}

分析：

很有可能是reduce方法寫錯了。於是在KPIBrowserIPReducer 方法體裏面右擊重新生成reduce方法。

自動生成的reduce方法如下：

protected void reduce(
Text arg0,
java.lang.Iterable<IntWritable> arg1,
org.apache.hadoop.mapreduce.Reducer<Text, IntWritable, Text, Text>.Context arg2)
throws IOException, InterruptedException {
}

對比分析，第二個參數傳入的對象不一樣，一個是Iterator<IntWritable>，而自動生成的是java.lang.Iterable<IntWritable>，問題就出在這個地方，當我在自己寫的reduce方法前加上關鍵字@Override 時，自己寫的方法立馬報錯。Iterator這個是hadoop老版本之前的寫法。

解決方案：使用自動生成的reduce方法，然後在裏面重新寫reduce處理邏輯。

經驗：儘量在重寫方法前加上@Override ，它對代碼的檢測是非常用幫助的。

lsxy117

發佈了103 篇原創文章 · 獲贊 14 · 訪問量 29萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

hadoop 設置了reduce但是無法執行的bug

「Pygors跨平臺GUI」1：Pygors跨平臺GUI應用研究

[轉帖]

python列出centos7內存使用前50的進程信息

「Pygors跨平臺GUI」2：安裝MinGW-w64、MSYS2還是WSL2

一鍵自動化博客發佈工具,用過的人都說好(掘金篇)

Garnet：微軟官方基於.NET開源的高性能分佈式緩存存儲數據庫

Flink執行圖

Java響應式編程

評估統計算法在銀行僞造鈔票檢測中的價值

Dokcer部署Kafka集羣

Hive集成Mysql作爲元數據時，提示錯誤：Specified key was too long; max key length is 767 bytes

hadoop1.2.1集羣增加datanode節點

win7使用eclipse連接hadoop集羣，運行mapreduce報錯之Failed to set permissions of path

linux awk命令詳解

reduce裏的一個坑

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結