I created a spark DataFrame in a Python paragraph in Zeppelin.
sqlCtx = SQLContext(sc)
spDf = sqlCtx.createDataFrame(df)
and df is a pandas dataframe
print(type(df))
<class 'pandas.core.frame.DataFrame'>
what I want to do is moving spDf from one Python paragraph to another Scala paragraph. It look a reasonable way to do is using z.put.
z.put("spDf", spDf)
and I got this error:
AttributeError: 'DataFrame' object has no attribute '_get_object_id'
Any suggestion to fix the error? Or any suggestion to move spDf?
You can
putinternal Java object not a Python wrapper:and then make sure you use correct type:
but it is better to register temporary table:
and use
SQLContext.tableto read it:To convert in the opposite direction see Zeppelin: Scala Dataframe to python