How do I save H2O Sparkling Water models to disk

142 Views Asked by Mahdi At 19 January 2023 at 07:56

I have a PySpark code to train an H2o DRF model. I need to save this model to disk and then load it.

from pysparkling.ml import H2ODRF
drf = H2ODRF(featuresCols = predictors,
                labelCol = response,
                columnsToCategorical = [response])

I can not find any document on this so I am asking this question here.

Original Q&A

There are 2 best solutions below

TheFon On 01 February 2023 at 03:56

I think the section of the docs on deploying pipeline models might be relevant: https://docs.h2o.ai/sparkling-water/2.3/latest-stable/doc/deployment/pysparkling_pipeline.html

Pipelines may not be what you're looking for depending on the use case.

Something like the following might work for your use case.

drf = H2ODRF(featuresCols = predictors,
                labelCol = response,
                columnsToCategorical = [response])

pipeline = Pipeline(stages=[drf])

model = pipeline.fit(data)
model.save("drf_model")

omoshiroiii On 27 April 2023 at 17:06

model.save("mySavePath")

and then later when you need to load the model:

model = pysparkling.ml.H2OMOJOModel.load("mySavePath")

How do I save H2O Sparkling Water models to disk

There are 2 best solutions below

Related Questions in PYSPARK

Related Questions in H2O

Related Questions in SPARKLING-WATER

Trending Questions

Popular # Hahtags

Popular Questions