What is oozie equivalent for Spark?

549 Views Asked by Aravind Yarram At 24 November 2015 at 00:55

We have very complex pipelines which we need to compose and schedule. I see that Hadoop ecosystem has Oozie for this. What are the choices for Spark based jobs when I am running Spark on Mesos or Standalone and doesn't have a Hadoop cluster?

Original Q&A

There are 2 best solutions below

srinath_perera On 26 November 2015 at 04:08 BEST ANSWER

Unlike with Hadoop, it is pretty easy to chains things with Spark. So writing a Spark Scala script might be enough. My first recommendation is tying that.

If you like to keep it SQL like, you can try SparkSQL.

If you have a really complex flow, it is worth looking at Google data flow https://github.com/GoogleCloudPlatform/DataflowJavaSDK.

Rakesh On 25 November 2015 at 12:58

Oozie can be used in case of Yarn, for spark there is no built in scheduler available, So you are free to choose any scheduler which works in the cluster mode.

For Mesos I feel Chronos would be the right choice, more info on Chronos

What is oozie equivalent for Spark?

There are 2 best solutions below

Related Questions in HADOOP

Related Questions in APACHE-SPARK

Related Questions in BIGDATA

Related Questions in APACHE-SPARK-1.5

Trending Questions

Popular # Hahtags

Popular Questions