Install Hadoop 2.6.0 On Ubuntu14.04
8 min readJun 21, 2017
### 1.下載Hadoop: 到Hadoop官網下載,或是在terminal使用wget。(Hadoop下載連結:http://apache.stu.edu.tw/hadoop/common/hadoop-2.6.0/hadoop-2.6.0.tar.gz ) ####2.解壓縮、般位置、設置捷徑:
$ tar -zxf hadoop-2.6.0.tar.gz
$ sudo mv hadoop-2.6.0/ /usr/local/
$ cd /usr/local
$ sudo ln -s hadoop-2.6.0/ hadoop
####3.給hadoop一個專用的使用者
$ sudo addgroup hadoop
$ sudo adduser --ingroup hadoop hduser
$ sudo adduser hduser sudo
$ sudo chown -R hduser:hadoop /usr/local/hadoop/
####4.設置ssh免密碼登入
$ su hduser
$ sudo apt-get install ssh
$ ssh-keygen -t rsa -P ""
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
$ chmod 600 ~/.ssh/authorized_keys && chmod 700 ~/.ssh
####5.調整/etc/sysctl.conf檔案設置 在文件最底下加入:
net.ipv6.conf.all.disable_ipv6 = 1
net.ipv6.conf.default.disable_ipv6 = 1
net.ipv6.conf.lo.disable_ipv6 = 1
調整完後: $ sudo service networking restart
或是$ sudo shutdown -r now
####6.測試一下
$ su hduser
$ ssh localhost
‘Are you sure you want to continue connecting?’(yes/no)yes
####7.更新並安裝Java
$ sudo apt-get update
$ sudo apt-get install default-jdk
如果已經有安裝的話可以使用$ which java | sed -e 's/\(.*\)\/bin\/java/\1/g'
找到你的JAVA_HOME路徑 切換使用者$ su hduser
並修改bash檔案$ sudo vi ~/.bashrc
在文件底部加入export路徑
export HADOOP_HOME=/usr/local/hadoop
export JAVA_HOME=/usr
重新sorce一次$ source ~/.bashrc
####8.調整Hadoop各項設定檔案
- 設置HDFS位置
$ su hduser
$ mkdir /usr/local/hadoop/data
- 修改/usr/local/hadoop/etc/hadoop/hadoop-env.sh
(1)
JAVA_HOME=${JAVA_HOME} 修改成 JAVA_HOME=/usr
(2)
export HADOOP_OPTS="$HADOOP_OPTS -Djava.net.preferIPv4Stack=true
修改成
export HADOOP_OPTS="$HADOOP_OPTS -Djava.net.preferIPv4Stack=true -Djava.library.path=$HADOOP_PREFIX/lib"
(3)文件最後加上
export HADOOP_COMMON_LIB_NATIVE_DIR=${HADOOP_PREFIX}/lib/native
- 修改/usr/local/hadoop/etc/hadoop/yarn-env.sh 底部加上
export HADOOP_CONF_LIB_NATIVE_DIR=${HADOOP_PREFIX:-"/lib/native"}
export HADOOP_OPTS="-Djava.library.path=$HADOOP_PREFIX/lib"
- 修改/usr/local/hadoop/etc/hadoop/core-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/usr/local/hadoop/data</value>
</property>
</configuration>
- 修改/usr/local/hadoop/etc/hadoop/mapred-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
- 修改/usr/local/hadoop/etc/hadoop/hdfs-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>dfs.replication</name>
<value>3</value>
</property>
</configuration>
- 修改/usr/local/hadoop/etc/hadoop/yarn-site.xml
<?xml version="1.0"?>
<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce_shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value>localhost:8025</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value>localhost:8030</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value>localhost:8050</value>
</property>
</configuration>
####9.Namenode格式化並啟動Hadoop
$ /usr/local/hadoop/bin/hadoop namenode -format
$ /usr/local/hadoop/sbin/start-dfs.sh
$ /usr/local/hadoop/sbin/start-yarn.sh
####10.完成並檢查 $ jps
會出現
在瀏覽器中輸入http://localhost:50070
會出現