AWS DataPipeline EMR cluster with spark

621 Views Asked by user1846749 At 05 September 2017 at 05:01

I have created an AWS DataPipeline using EMR template, but its not installing Spark on EMR cluster. Do I need to set any special action for that ? I see some bootstrapaction is need for spark installation but that is also not working.

Original Q&A

There are 1 best solutions below

Spark-Beginner On 04 December 2017 at 09:37

That install-spark bootstrap action is only for 3.x AMI versions. If you are using a releaseLabel (emr-4.x or beyond), the applications to install are specified in a different way.

When you are creating a pipeline, you click "Edit in Architect" at the bottom or edit your pipeline on pipelines home page then you can then click on the EmrCluster node and select Applications from the "Add an optional field..." dropdown. That is where you may add Spark.

AWS DataPipeline EMR cluster with spark

There are 1 best solutions below

Related Questions in APACHE-SPARK

Related Questions in EMR

Related Questions in AMAZON-DATA-PIPELINE

Trending Questions

Popular # Hahtags

Popular Questions