I have some xml that looks like this:
<topic>
<restrictions>
<restriction id="US"/>
<restriction id="CA"/>
<restriction id="EU"/>
</restrictions>
</topic>
<topic>
<restrictions>
<restriction id="JP"/>
<restriction id="AU"/>
<restriction id="EU"/>
<restriction id="US"/>
</restrictions>
</topic>
And different iterations with the same pattern. I'm already using minidom in my script to do some other things with the xml. For the example above I need to get as result the following:
[['US','CA','EU'],['JP','AU','EU','US']]
I have tried different iterations with the incorrect result. This is my code:
from xml.dom import minidom
xmldoc = minidom.parse(path_to_file)
itemlist = xmldoc.getElementsByTagName('restrictions')
itemlist2 = xmldoc.getElementsByTagName('restriction')
restrictions=[]
for x in itemlist:
res=[]
for s in itemlist2:
res.append(s.attributes['id'].value)
restrictions.append(res)
print(restrictions)
Can you please help me to get the iteration correctly? Any help is appreciated. Thanks!
EDIT: Just realized something else might happen that I need to account for just in case. It can also happen that a topic element does not have a element at all, and when that happens, the value appended to the list should be just 0. What is an easy way to make that condition?
getElementsByTagName
returns all elements with the corresponding tag name. Soitemlist2
contains allrestriction
notes in the XML. In your code, it will add all these nodes['US','CA','EU','JP','AU','EU','US']
for eachrestrictions
node. So you should try to getrestriction
nodes for eachrestrictions
node separately in the loop.