Hadoop 2.x 僞分佈式環境搭建
$ sudo chown ijeffrey:ijeffrey -R /opt/
$ mkdir modules software
$ tar -zxvf hadoop-2.7.3.tar.gz -C /opt/modules/apache/
$ tar -zxf jdk-7u67-linux-x64.tar.gz -C /opt/modules
jdk:rpm -qa | grep jdk
$ sudo rpm -e --nodeps java-1.6.0-openjdk-1.6.0.0-1.50.1.11.5.el6_3.x86_64
$ sudo rpm -e --nodeps java-1.7.0-openjdk-1.7.0.9-2.3.4.1.el6_3.x86_64
$ sudo vim /etc/profile
$ cd /opt/modules/apache/hadoop2.7.3
$ rm -rf doc
進入到/opt/modules/apache/hadoop2.7.3/etc/hadoop裏面配置三個env文件:
<!--NameNode地址,8020是指定進程8020,訪問入口 -->
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:8020</value>
</property>
<!--hadoop在運行時產生的文件,元數據在本地的存放目錄 -->
<property>
<name>hadoop.tmp.dir</name>
<value>/opt/modules/apache/hadoop-2.7.3/data/tmp</value>
</property>
<configuration>
<!--存放到hdfs上的文件的副本數,僞分佈式配置爲1 -->
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<!-- 配置 secondarynamenode -->
<property>
<name>dfs.namenode.secondary.http-address</name>
<value>hadoop01.com:50090</value>
</property>
</configuration>
$ bin/hdfs namenode -format
$ sbin/hadoop-daemon.sh start namenode
$ sbin/hadoop-daemon.sh start datanode
<!-- 啓動secondarynamenode -->
$ sbin/hadoop-daemon.sh start secondarynamenode
--Secondary NameNode 用來監控HDFS狀態的輔助後臺程序,每隔一段時間獲取HDFS元數據的快照
【常見問題】
刪除PID
$ rm -rf /tmp/xxx.pid
hadoop tar包和jdk tar 包下載地址:
https://download.csdn.net/download/u014414323/10420850
https://download.csdn.net/download/u014414323/10420846