Cloudera Enterprise 5.16.x | Other versions

Step 4: Install CDH Packages

  Note: When configuring high availablility for HDFS, do not install hadoop-hdfs-secondarynamenode. After enabling HDFS HA, follow the installation instructions under Deploying HDFS High Availability.

Before you install CDH, review Recommended Cluster Hosts and Role Distribution.

Install CDH Packages

  1. Install CDH packages on the appropriate hosts, as follows:
    • RHEL Compatible:
      sudo yum install <package_name> [<package_name>...]
    • SLES:
      sudo zypper install <package_name> [<package_name>...]
    • Ubuntu:
      sudo apt-get install <package_name> [<package_name>...]

    The following table lists the package names for each component.

      Note: Most of the packages below contain only init.d scripts for the associated services, but have dependencies on the core software. Your package manager will automatically install the required dependent packages. For example, installing zookeeper-server will also automatically install zookeeper.
    Role Package Name
    HDFS
    NameNode hadoop-hdfs-namenode
    Secondary NameNode hadoop-hdfs-secondarynamenode
    DataNode hadoop-hdfs-datanode
    HttpFS hadoop-httpfs
    MapReduce v2 with YARN
    ResourceManager hadoop-yarn-resourcemanager
    NodeManager hadoop-yarn-nodemanager
    JobHistory Server hadoop-mapreduce-historyserver
    Hadoop Clients
    Hadoop Client hadoop-client
    Flume
    Flume Agent flume-ng-agent
    HBase
    All HBase Roles hbase
    Hive
    HiveServer2 hive-server2
    Hive Metastore hive-metastore
    Hive Client hive
    Hive HBase Connector hive-hbase
    HCatalog
    HCatalog hive-hcatalog
    WebHCat hive-webhcat-server
    Impala
    Impala Daemon impala-server
    StateStore impala-state-store
    Catalog Server impala-catalog
    Impala Shell impala-shell
    Hue
    Hue Server hue
    Kafka
    Kafka Broker kafka-server
    Kudu
    Kudu Master kudu-master
    Tablet Server kudu-tserver
    Kudu Client kudu-client0
      Note: For Ubuntu, the Kudu client package name is libkuduclient0
    Kudu SDK kudu-client-devel
      Note: For Ubuntu, the Kudu SDK package name is libkuduclient-dev
    Spark
    Spark Worker spark-core
    Spark History Server spark-history-server
    Spark Python Client spark-python
    KMS
    Java KeyStore KMS hadoop-kms-server
    Oozie
    Oozie Server oozie
    Oozie Client oozie-client
    Search
    Solr Server solr-server
    Solr MapReduce Tools solr-mapreduce
    Lily HBase Indexer hbase-solr-indexer
    Spark Indexer solr-crunch
    Sentry
    Sentry Server sentry
    Sqoop
    Sqoop Metastore sqoop-metastore
    Sqoop Client sqoop
    Sqoop 2
    Sqoop 2 Server sqoop2-server
    Sqoop 2 Client sqoop2-client
    ZooKeeper
    ZooKeeper Server zookeeper-server

(Optional) Install LZO

This section explains how to install LZO ( Lempel–Ziv–Oberhumer) compression. For more information, see Choosing and Configuring Data Compression.

  1. Add the repository on each host in the cluster. Follow the instructions for your OS version:
    • RHEL Compatible:
      sudo wget -O /etc/yum.repos.d/ https://archive.cloudera.com/gplextras5/redhat/<version>/x86_64/gplextras/cloudera-gplextras5.repo

      Replace <version> with your RHEL version: 7, 6, or 5

    • SLES 12:
      sudo zypper addrepo -f https://archive.cloudera.com/gplextras5/sles/<version>/x86_64/gplextras/cloudera-gplextras5.repo

      Replace <version> with your SLES version: 12 or 11

    • Ubuntu:
      sudo wget -O /etc/apt/sources.list.d/ https://archive.cloudera.com/gplextras5/ubuntu/<version>/amd64/gplextras/cloudera-gplextras.list

      Replace <version> with your Ubuntu version: xenial, trusty, precise, or lucid

    • Debian:
      sudo wget -O /etc/apt/sources.list.d/ https://archive.cloudera.com/gplextras5/debian/<version>/amd64/gplextras/cloudera.list

      Replace <version> with your Debian version: jessie, squeeze, or wheezy

  2. Install the hadoop-lzo package:
    • RHEL compatible:
      sudo yum install hadoop-lzo
    • SLES:
      sudo zypper install hadoop-lzo
    • Ubuntu, Debian:
      sudo apt-get install hadoop-lzo
  3. Continue with installing and deploying CDH. As part of the deployment, you will need to do some additional configuration for LZO, as shown under Configuring LZO.
      Important: Be sure to do this configuration after you have copied the default configuration files to a custom location and set alternatives to point to it.

Set Up a CDH Cluster

Page generated October 24, 2018.