Hadoop學習筆記_3：運行模式之本地模式

原創

编程小透明

2020-07-07 23:10

Hadoop運行模式

本地模式

By default, Hadoop is configured to run in a non-distributed mode, as a single Java process. This is useful for debugging.

默認情況下，Hadoop被配置爲以非分佈式模式作爲單個Java進程運行。這對於調試很有用。
- 官方Grep案例
  
  The following example copies the unpacked conf directory to use as input and then finds and displays every match of the given regular expression. Output is written to the given output directory.
  
  下面的示例複製解壓縮的配置目錄以用作輸入，然後查找並顯示給定正則表達式的每個匹配項。輸出被寫入給定的輸出目錄。
```
$ mkdir input
$ cp etc/hadoop/*.xml input
$ bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.2.jar grep input output 'dfs[a-z.]+'
$ cat output/*
```
  實際操作：
  - 構造輸入
  - 執行提供的案例grep
  - 查看輸出（output文件夾不要手動創建，在程序執行過程中會自動創建。手動創建會出現org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory file:/opt/module/hadoop-2.7.2/output already exists異常。）
    
    _SUCCESS存在代表執行成功
- 官方WordCount案例（統計單詞格個數）
  
  實際操作：
  - 構造輸入
```
[root@localhost hadoop-2.7.2]# mkdir wcinput
[root@localhost hadoop-2.7.2]# cd wcinput/
[root@localhost wcinput]# touch wc.input
[root@localhost wcinput]# vim wc.input 
[root@localhost wcinput]# cat wc.input 
Baidu Alibaba
ByteDance
zhangsan
lisi
wangwu wangwu
Bcxtm
Bcxtm
Bcxtm
```
  - 執行提供的案例wordcount
```
[root@localhost hadoop-2.7.2]# bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.2.jar wordcount wcinput/ wcoutput
```
  - 查看輸出
```
[root@localhost hadoop-2.7.2]# cd wcoutput/
[root@localhost wcoutput]# ll
總用量 4
-rw-r--r-- 1 root root 65 7月   5 10:40 part-r-00000
-rw-r--r-- 1 root root  0 7月   5 10:40 _SUCCESS
[root@localhost wcoutput]# cat part-r-00000 
Alibaba	1
Baidu	1
Bcxtm	3
ByteDance	1
lisi	1
wangwu	2
zhangsan	1
```

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Hadoop學習筆記_3：運行模式之本地模式

Hadoop運行模式

七天.NET 8操作SQLite入門到實戰 - （2）第七天Blazor班級管理頁面編寫和接口對接

自學編程兩個月，現在我月入 4 萬元

百度安全多篇議題入選Blackhat Asia以硬技術發現“芯”問題

「實戰應用」如何用圖表控件LightningChart創建2D氣泡圖

GtkSharp 設置窗口背景透明

Google Chrome驅動程序 124.0.6367.62（正式版本）去哪下載？

Hadoop學習筆記_1：Hadoop相關生態圈瞭解學習

Hadoop學習筆記_3：運行模式之本地模式

Hadoop學習筆記_4：運行模式之僞分佈式模式

Hadoop學習筆記_2：環境搭建（JDK+Hadoop）

使用RXTXcomm進行串口通信

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結