I am trying to read xml/nested xml in pyspark using spark-xml jar.
df = sqlContext.read \
.format("com.databricks.spark.xml")\
.option("rowTag", "hierachy")\
.load("test.xml"
when I execute, data frame is not creating properly.
+--------------------+
| att|
+--------------------+
|[[1,Data,[Wrapped...|
+--------------------+
xml format I have is mentioned below :
heirarchy
should be rootTag andatt
should be rowTag asand you should get
and
schema
find more information on databricks xml