共计 6799 个字符,预计需要花费 17 分钟才能阅读完成。
1. 基本环境:
操作系统:
CentOS 7.2.1511
jdk 环境
版本:jdk-8u45-linux-x64.rpm
mysql 环境:
rpm 包:http://ftp.ntu.edu.tw/MySQL/Downloads/MySQL-5.6/MySQL-5.6.33-1.linux_glibc2.5.x86_64.rpm-bundle.tar
jdbc 链接包:http://101.96.10.72/cdn.mysql.com//Downloads/Connector-J/mysql-connector-Java-5.1.40.tar.gz
CDH 安装相关的包:
cloudera manager 包:http://archive.cloudera.com/cm5/cm/5/cloudera-manager-centos7-cm5.8.3_x86_64.tar.gz
CDH 包:http://archive.cloudera.com/cdh5/parcels/5.8.3/CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel.sha1
http://archive.cloudera.com/cdh5/parcels/5.8.3/CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel
http://archive.cloudera.com/cdh5/parcels/5.8.3/manifest.json
集群规划
IP 地址 主机名说明
192.168.50.123Hadoop1 主节点 master,datanode
192.168.50.124hadoop2datanode
192.168.50.125hadoop3 datanode
开始安装前配置
1. 安装 jdk(每个机器都要装)
安装前要先卸载掉原有的 jdk 版本,避免造成冲突
2. 修改三个机器上面的 hosts
192.168.50.123 hadoop1
192.168.50.124 hadoop2
192.168.50.125 hadoop3
3. 同步时间
ntpdate -s pool.ntp.org
4. 关闭防火墙和 selinux
sed -i ‘s/SELINUX=.*/SELINUX=disabled/’ /etc/selinux/config #重启机器
systemctl stop firewalld
systemctl disable firewalld
5. 配置 ssh 无密码登陆
[root@localhost ~]# ssh-keygen -t rsa -P ”
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa):
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
62:b0:4c:aa:e5:37:92:89:4d:db:c3:38:e2:f1:2a:d6 root@admin-node
The key’s randomart image is:
+–[RSA 2048]—-+
| |
| |
| o |
| + o |
| + o o S |
| B B . . |
|+.@ * |
|oooE o |
|oo.. |
+—————–+
ssh-copy-id hadoop1
ssh-copy-id hadoop2
ssh-copy-id hadoop3
6. 安装 mysql
[root@hadoop1]#tar -xvf MySQL-5.6.33-1.linux_glibc2.5.x86_64.rpm-bundle.tar
[root@hadoop1]#rpm -ivh MySQL-*.rpm
修改配置文件路径:cp /usr/share/mysql/my-default.cnf /etc/my.cnf
# 配置 mysql
[root@hadoop1]#vim /etc/my.cnf
[mysqld]
default-storage-engine = innodb
innodb_file_per_table
collation-server = utf8_general_ci
init-connect = ‘SET NAMES utf8’
character-set-server = utf8
# 初始化数据库
/usr/local/mysql/scripts/mysql_install_db –basedir=/usr/local/mysql/ –datadir=/data/mysql/ –user=mysql >>/dev/null
# 启动 mysql
service mysqld start
chkconfig mysqld on
– 查看 mysql root 初始化密码
[root@hadoop1]# cat /root/.mysql_secret
# The random password set for the root user at Fri Sep 16 11:13:25 2016 (local time): 9mp7uYFmgt6drdq3
– 登录进行去更改密码
[root@hadoop1]# mysql -u root -p
mysql> SET PASSWORD=PASSWORD(‘123456’);
– 允许 mysql 远程访问
mysql> grant all on *.* to root@”%” Identified by “www.123”;
Query OK, 1 row affected (0.05 sec)
mysql> flush privileges;
Query OK, 0 rows affected (0.00 sec)
创建 cdh 所需要的库
create database hive DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)
create database amon DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)
create database hue DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)
create database monitor DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)
create database oozie DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)
7. 第三方依赖包安装(所有节点都安装)
1 yum install chkconfig Python bind-utils psmisc libxslt zlib sqlite fuse fuse-libs RedHat-lsb cyrus-sasl-plain cyrus-sasl-gssapi
注意这个地方依赖包不安装完下面启动集群的时候会死活启动不了的,这是血的教训啊!
在 hadoop1 上准备 mysql 的 jar 包
[root@hadoop1]# mkdir -p /usr/share/java
修改 jar 包的名字,并拷贝到 /usr/share/java/ 目录
[root@hadoop1]# cp mysql-connector-java-5.1.40-bin.jar /usr/share/java/mysql-connector-java.jar
8. 安装 Cloudera-Manager
解压 cm 包到指定目录,所有服务器都要做
[root@hadoop1 ~]#mkdir /opt/cloudera-manager
[root@hadoop1 ~]# tar -axvf cloudera-manager-centos7-cm5.8.3_x86_64.tar.gz -C /opt/cloudera-manager
创建 cloudera-scm 用户(所有节点)
[root@hadoop1 ~]# useradd -r -d /opt/cloudera-manager/cm-5.8.3/run/cloudera-scm-server -M -c “Cloudera SCM User” cloudera-scm
在 hadoop2 和 hadoop3 配置 agent
vim /opt/cloudera-manager/cm-5.8.3/etc/cloudera-scm-agent/config.ini
将 server_host 改为 CMS 所在的主机名即 hadoop1
主节点中创建 parcel-repo 仓库
[root@hadoop1 ~]# chown cloudera-scm:cloudera-scm /opt/cloudera/parcel-repo
[root@hadoop1 ~]# mv CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel.sha1 CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel.sha
[root@hadoop1 ~]# cp CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel CDH-5.8.3-1.cdh5.8.3.p0.2-el7.parcel.sha manifest.json /opt/cloudera/parcel-repo
解释:Clouder-Manager 将 CDHs 从主节点的 /opt/cloudera/parcel-repo 目录中抽取出来,分发解压激活到各个节点的 /opt/cloudera/parcels 目录中
初始脚本配置数据库 scm_prepare_database.sh(在主节点上)
[root@hadoop1 ~]# /opt/cloudera-manager/cm-5.8.3/share/cmf/schema/scm_prepare_database.sh mysql -h hadoop1 -P 3306 -uroot -pwww.123 –scm-host master scm scm scm
JAVA_HOME=/usr/java/jdk1.8.0_45
Verifying that we can write to /opt/cloudera-manager/cm-5.8.3/etc/cloudera-scm-server
Creating SCM configuration file in /opt/cloudera-manager/cm-5.8.3/etc/cloudera-scm-server
Executing: /usr/java/jdk1.8.0_45/bin/java -cp /usr/share/java/mysql-connector-java.jar:/usr/share/java/Oracle-connector-java.jar:/opt/cloudera-manager/cm-5.8.3/share/cmf/schema/../lib/* com.cloudera.enterprise.dbutil.DbCommandExecutor /opt/cloudera-manager/cm-5.8.3/etc/cloudera-scm-server/db.properties com.cloudera.cmf.db.
[main] DbCommandExecutor INFO Successfully connected to database.
All done, your SCM database is configured correctly!
说明:这个脚本就是用来创建和配置 CMS 需要的数据库的脚本。各参数是指:
mysql:数据库用的是 mysql,如果安装过程中用的 oracle,那么该参数就应该改为 oracle。
-hhadoop1:数据库建立在 hadoop1 主机上面。也就是主节点上面。
-uroot:root 身份运行 mysql。-123456:mysql 的 root 密码是 ***。
–scm-host hadoop1:CMS 的主机,一般是和 mysql 安装的主机是在同一个主机上。
最后三个参数是:数据库名,数据库用户名,数据库密码。
10. 在各个节点启动 agent 服务
/opt/cloudera-manager/cm-5.8.3/etc/init.d/cloudera-scm-agent start
在 master 启动 server 服务
/opt/cloudera-manager/cm-5.8.3/etc/init.d/cloudera-scm-server start
浏览器访问
http://192.168.50.123:7180/cmf/login 用户名 admin 密码 admin
问题 1:
service cloudera-scm-server status
cloudera-scm-server dead but pid file exists
解决
[root@master cm-5.8.3]# rm /root/hadoop/cm-5.8.3/run/cloudera-scm-server.pid
[root@master hadoop]# ./cm-5.8.3/etc/init.d/cloudera-scm-server restart
cloudera-scm-server is already stopped
Starting cloudera-scm-server: [OK]
问题 2:
2016-12-08 03:40:57,479 ERROR WebServerImpl:com.cloudera.server.web.cmf.search.components.SearchRepositoryManager: The server storage directory [/var/lib/cloudera-scm-server] doesn’t exist.
2016-12-08 03:40:57,479 ERROR WebServerImpl:com.cloudera.server.web.cmf.search.components.SearchRepositoryManager: No read permission to the server storage directory [/var/lib/cloudera-scm-server]
2016-12-08 03:40:57,479 ERROR WebServerImpl:com.cloudera.server.web.cmf.search.components.SearchRepositoryManager: No write permission to the server storage directory [/var/lib/cloudera-scm-server]
解决:
创建目录并加上权限以后成功
mkdir /var/lib/cloudera-scm-server
chown -R cloudera-scm.cloudera-scm /var/lib/cloudera-scm-server
问题 3:在 CDH 检查主机哪里会有两个警告
解决:
echo never > /sys/kernel/mm/transparent_hugepage/defrag
echo 10 > /proc/sys/vm/swappiness
CDH 的安装和设置 http://www.linuxidc.com/Linux/2017-02/140707.htm
yum 安装 CDH5.5 Hadoop 集群 http://www.linuxidc.com/Linux/2017-02/140186.htm
CDH5.9.0 集群部署与搭建 http://www.linuxidc.com/Linux/2017-01/139615.htm
CDH5.7.2 离线部署笔记 http://www.linuxidc.com/Linux/2016-08/133924.htm
Cloudera Manager 5 和 CDH5 离线安装 http://www.linuxidc.com/Linux/2016-07/133360.htm
本文永久更新链接地址 :http://www.linuxidc.com/Linux/2017-03/141294.htm