python minidom: 'NoneType' object has no attribute 'data' from url

1.1k Views Asked by At

I'm trying to parse a XML with Python using minidom. When I'm parsing a xml file from my filesystem I haven't any probnlem.

doc = minidom.parse("PATH HERE")
etiquetaDia = doc.getElementsByTagName("dia")
for dia in etiquetaDia:
    probPrecip = dia.getElementsByTagName("maxima")[0]      
    print(probPrecip.firstChild.data)

But when I try to parse a XML from a url with this code:

url = urllib2.urlopen('URL HERE') 
doc = minidom.parse(url)
etiquetaDia = doc.getElementsByTagName("dia")
for dia in etiquetaDia:
    probPrecip = dia.getElementsByTagName("maxima")[0]      
    print(probPrecip.firstChild.data)   

I have an error message enter image description here

Obviously it's the same XML in path and in url. Thanks

2

There are 2 best solutions below

4
On BEST ANSWER

Try the new urllib library instead like below. It prints out Hello. Is that what you want?

from xml.dom import minidom
from urllib import request

url = request.urlopen('http://localhost:8000/sample.xml')
doc = minidom.parse(url)
etiquetaDia = doc.getElementsByTagName("dia")
for dia in etiquetaDia:
    probPrecip = dia.getElementsByTagName("maxima")[0]
    print(probPrecip.firstChild.data)

Sample XML

<?xml version="1.0" encoding="UTF-8"?>
<dia>
    <maxima>Hello</maxima>
</dia>
1
On

The urlopen function returns an HttpResponse object. You must first call the read() method of this object to get the actual content of the response, and pass that to minidom

minidom.parse(url.read())