Cloudera Enterprise 5.16.x | Other versions

Setting Up Apache Spark Using the Command Line

Spark is a fast, general engine for large-scale data processing.

See also the Apache Spark Documentation.

  Note:

If you uploaded the Spark JAR file as described under Optimizing YARN Mode in Unmanaged CDH Deployments, use the same instructions to upload the new version of the file each time you upgrade to a new minor release of CDH (for example, any CDH 5.4.x release, including 5.4.0).

Page generated October 24, 2018.